EP4132964A2 - Leurres protéiques de novo de l'enzyme 2 de conversion de l'angiotensine (ace2) - Google Patents

Leurres protéiques de novo de l'enzyme 2 de conversion de l'angiotensine (ace2)

Info

Publication number
EP4132964A2
EP4132964A2 EP21721753.8A EP21721753A EP4132964A2 EP 4132964 A2 EP4132964 A2 EP 4132964A2 EP 21721753 A EP21721753 A EP 21721753A EP 4132964 A2 EP4132964 A2 EP 4132964A2
Authority
EP
European Patent Office
Prior art keywords
amino acid
substituted
seq
decoy
ace2
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP21721753.8A
Other languages
German (de)
English (en)
Inventor
Thomas Linsky
Daniel Adriano SILVA MANZANO
Nuria CODINA CASTILLO
Jorgen NELSON
Matthew James Walker
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Neoleukin Therapeutics Inc
Original Assignee
Neoleukin Therapeutics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Neoleukin Therapeutics Inc filed Critical Neoleukin Therapeutics Inc
Publication of EP4132964A2 publication Critical patent/EP4132964A2/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants
    • C07K14/70596Molecules with a "CD"-designation not provided for elsewhere
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/43Enzymes; Proenzymes; Derivatives thereof
    • A61K38/46Hydrolases (3)
    • A61K38/48Hydrolases (3) acting on peptide bonds (3.4)
    • A61K38/4813Exopeptidases (3.4.11. to 3.4.19)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K45/00Medicinal preparations containing active ingredients not provided for in groups A61K31/00 - A61K41/00
    • A61K45/06Mixtures of active ingredients without chemical characterisation, e.g. antiphlogistics and cardiaca
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/12Antivirals
    • A61P31/14Antivirals for RNA viruses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/48Hydrolases (3) acting on peptide bonds (3.4)
    • C12N9/485Exopeptidases (3.4.11-3.4.19)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y304/00Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
    • C12Y304/17Metallocarboxypeptidases (3.4.17)
    • C12Y304/17023Angiotensin-converting enzyme 2 (3.4.17.23)

Definitions

  • FIELD [002] The present invention is related to de novo protein decoys of ACE2.
  • BACKGROUND [003] Viruses often exploit cell surface-associated proteins to enter and infect host cells. Neutralizing antibodies can inhibit this process by binding to the surface of the virus, impeding its interaction with the target cell’s surface protein(s) and/or preventing viral- associated conformational changes necessary for infection. Vaccination has, therefore, been a broadly useful tool to combat many viral diseases.
  • RNA-based viruses are a frequent exception to this strategy.
  • Neutralizing antibodies bind to viral proteins in a fundamentally different fashion than how the virus interacts with its cell target. Therefore, RNA viruses with high mutational rates often exploit such structural discrepancy to escape the immune system by remodeling the shape of their receptor-binding proteins to evade neutralizing antibodies while retaining the interaction with their target receptor(s).
  • Coronaviruses are large, enveloped, positive-stranded RNA viruses. The genome is packed inside a helical capsid formed by the nucleocapsid protein and is further surrounded by the viral envelope. At least three structural proteins are associated with the viral envelope, including the envelope-anchored spike protein.
  • Coronaviruses recognize a variety of receptors, and binding by the spike protein mediates coronavirus entry into host cells via those receptors. Coronaviruses are believed to enter host cells via a two-step process. First, the spike protein binds to a receptor on the host cell surface through its S1 subunit and then fuses the viral and host membranes through its S2 subunit. Both the viral attachment step and membrane fusion process are mediated by recognition of the host receptor by the spike protein.
  • the alphacoronavirus HCoV-NL63 and the betacoronavirus SARS-CoV both recognize the receptor zinc peptidase angiotensin-converting enzyme 2 (ACE2).
  • SARS-CoV-2 is a highly contagious virus that causes coronavirus disease 2019 (COVID-19). Since its emergence in December 2019, SARS-CoV-2 has caused millions of cases of COVID19 and has become a global pandemic. While the majority of subjects are asymptomatic or have mild disease, a number of subjects will develop either severe disease with dyspnea or hypoxia or critical disease with symptoms of respiratory failure requiring positive pressure ventilation, shock, or multi-organ failure.
  • Figure 1 – Figure 1 is a schematic showing inhibition of viral entry into cells by a de novo designed ACE2 protein decoy.
  • SARS-CoV and SARS-CoV-2 viruses enter cells by first binding to the ACE2 receptor on the surface of human cells via the RBD domain of the spike protein (left).
  • the de novo designed ACE2 decoy binds to the RBD domain in the same manner as natural ACE2 but, unlike ACE2, has no biological function.
  • Figure 2A-F – Figure 2A-F demonstrate by yeast surface display that the de novo ACE2 protein decoy CTC-445 binds to SARS-CoV-2 Spike/RBD Protein.
  • Figure 2A-B show a positive (ACE-2) control and
  • Figure 2C-D show a negative control (human IL-21R).
  • Figure 2E-F show CTC-445.
  • Figure 3A-D - Figure 3A-D demonstrate that control human ACE2 (3A-B) and de novo protein decoy CTC-445 (3C-D) bind to SARS-CoV-2 Spike/RBD Protein and compete with soluble ACE2 for binding to the Spike/RBD protein.
  • Figure 4 – Figure 4 shows the kinetics of binding of purified ACE2 protein decoy CTC-445 via an Octet biolayer interferometry (BLI) binding assay.
  • Figure 5A-H – Figure 5A-H shows the kinetics of binding of select purified ACE2 protein decoys via Octet BLI binding assays.
  • Figure 6A-H – Figure 6A-H shows the kinetics of binding of select purified ACE2 protein decoys via Octet BLI binding assays.
  • Figure 7A-H – Figure 7A-H shows the kinetics of binding of select purified ACE2 protein decoys CTC-445 variants via Octet BLI binding assays.
  • Figure 8A-C Circular dichroism absorption at 222 nm of ACE2 protein decoys CTC-445 (A), CTC-445.2 (B) and CTC-445.2d (C).
  • Figure 9A-C – Figure 9A-C shows the thermal recovery of ACE2 protein decoys CTC-445, CTC-445.2 and CTC-445.2d after repeated cycles of heating and cooling. The data shows that the designed proteins refold even after repeated thermal denaturation.
  • Figure 10A-C – Figure 10A-C show the kinetics of binding of purified ACE2 protein decoys CTC-445 (A), CTC 445.2 (B) and CTC-445.2d (C) via Octet BLI binding assays.
  • Figure 11 – Figure 11 provides a plot of the potency of select ACE2 protein decoys vs their molecular weight. IC 50 is measured by ELISA.
  • Figure 12A-B – Figure 12A shows a neutralization assay performed using a non- replicative VSV pseudovirus carrying a luciferase reporter gene and expressing the spike protein of SARS-CoV-2 on its surface.
  • Viral neutralization with ACE2 protein decoys CTC- 445.2 and CTC-445.2d was performed on HEK 293T cells overexpressing ACE2.
  • the test proteins were pre-incubated with pseudovirus prior to incubation with cells.
  • Samples were tested in duplicate utilizing 3-fold serial dilutions started at 20 ⁇ M (CTC-445.2) or 10 ⁇ M (CTC-445.2d).
  • a cell viability assay (12B) was run in parallel.
  • Figure 13 – Figure 13 shows bioavailability of ACE2 protein decoy CTC-445.2d in mice lung (top) and plasma (bottom) after intranasal administration. Protein concentration in lung lysates and blood plasma are quantified using Meso Scale Discovery platform.
  • Figure 14 – Figure 14 shows ACE2 functional activity as measured by enzymatic release of a free fluorophore from Mca-APK(DNP) substrate. ACE2 inhibition was shown using DX600 peptide as a positive control.
  • Figure 15A-B – Figure 15 shows the kinetics of binding of ACE2 protein decoys CTC-445.2 (A) and CTC-445.2d (B) to SARS-CoV-1 via Octet BLI binding assays.
  • Figure 16A-E – Figure 16A-E show the shows the kinetics of binding of ACE2 protein decoy CTC-445.2 to SARS-CoV-2 RBD mutants via Octet binding assays.
  • Figure 17 – Figure 17 provides the designed ACE2 protein decoy CTC-445 in complex with the SARS-CoV-2 spike protein RBD (surface representation).
  • the graph on the right on the cartoon representations shows biased forward folding simulations of designed sequence.
  • the designed sequence was subjected to ab initio structure prediction using Rosetta.
  • Each point in the plot represents an independent folding trajectory which was computed by Monte Carlo insertion of fragments from solved protein structures. Folding simulations were biased towards the designed conformation by using a small subset of fragments at each residue position with the lowest RMSD (9- and 3-mers) to the designed structure.
  • Red (or black for B&W) dots are trajectories computed using the 5 fragments from Rosetta's vall structural database with lowest RMSD to the designs at each amino acid position.
  • Brown (or gray) dots are trajectories computed using fragments from the design model itself plus the 8 lowest RMSD 3mers and 9mers.
  • the funnel-shaped energy landscape suggests that the designed structure is the global energy minima and has a substantial energy gap with respect to alternative conformations.
  • Figure 18A-B – Figure 18A-B shows a structural alignment of ACE2 protein decoy CTC-640 with a non-redundant database of known structures.
  • Structural alignment was performed using MICAN in rewiring and reverse mode with maximum distance between C ⁇ atoms to be aligned of 10.0 A.
  • Each gray point represents the structural alignment of CTC- 640 with a different structure in the database.
  • Each black point represents the structural alignment of CTC-640 with different structures of ACE2.
  • structural alignments of ⁇ 50 total residues were discarded.
  • Structural alignments are performed using TMalign, which aligns structures based on the order of the secondary structure elements in the polypeptide chain. Sequence identity is computed based on the structurally aligned residues.
  • CTC-640 mimics the ACE2 binding surface, it does not align well in lineal structure or sequence to any protein in the database, including ACE2.
  • Figure 19A-C – Figure 19A-C shows the kinetics of binding of ACE2 protein decoy CTC-708 (CTC-445.2t) via Octet binding assays to SARS-CoV-2 (19A), SARS- CoV-1 (19B), and results of a competition assay
  • Figure 20A-C – Figure 20A-C provides results from deep mutational scanning of ACE2 protein decoy CTC445.2 and plotted as sequence logo using logomaker [ref: https://www.biorxiv.org/content/10.1101/635029v1]. Letters are scaled according to their probability and ordered from highest probability (top) to lowest (bottom). The native sequence of CTC445.2 is shaded in black.
  • FIG. 21A-C - Figure 21A shows neutralization of SARS-CoV-2 infection with ACE2 protein decoys CTC445.2d and CTC445.3d in engineered HEK293T cells overexpressing hACE2 determined using a non-replicative VSV pseudovirus carrying a luciferase reporter gene and expressing the spike protein of SARS-CoV-2 (GenBank: QHD43416.1) on its surface.
  • Figure 21B shows a neutralization assay using control pseudovirus expressing VSVg instead of spike protein.
  • Figure 21C shows cell viability of engineered HEK293T cells incubated with ACE2 protein decoys CTC-445.2d and CTC- 445.3d.
  • the present inventors have built de novo proteins that can accurately recapitulate the natural binding surface targeted by some coronaviruses.
  • the present inventors have built de novo proteins that present a binding surface recognized by coronaviruses, in particular, coronaviruses that use ACE2 to mediate entry into host cells. By binding to the coronavirus, these de novo proteins can act as decoys for host ACE2 protein and, in certain aspects, prevent the virus from binding to its receptor, ACE2.
  • proteins of the present invention are useful, inter alia, for inhibiting or neutralizing the activity of the virus.
  • the proteins are useful for blocking binding of the virus to its host cell receptor and for preventing the entry of the coronavirus into host cells.
  • the proteins function by inhibiting the cell-to-cell transmission of the virus.
  • the proteins are useful in preventing, treating or ameliorating at least one symptom of coronavirus infection in a subject.
  • the proteins may be administered prophylactically or therapeutically to a subject having or at risk of having coronavirus infection.
  • the present invention provides de novo proteins, ACE2 protein decoys, that bind specifically to coronavirus spike protein, in particular, spike protein from those coronaviruses that use ACE2 as their receptor to facilitate viral entry into target cells, for example, SARS-CoV and SARS-CoV-2.
  • de novo proteins, ACE2 protein decoys that block (e.g., partially or fully) coronavirus spike protein binding to its native receptor and, in particular, block coronavirus spike protein binding to ACE2.
  • the present invention provides de novo proteins, ACE2 protein decoys, that block the binding of coronavirus to its native human, camel or bat ACE2 receptor.
  • the de novo proteins of the present invention are non-naturally occurring and are comprised of peptide domains, including at least two alpha helical domains, H1 and H2, and an optional beta hairpin domain, H3. These three domains interface with the coronavirus spike protein.
  • Exemplary de novo proteins of the present invention further comprise at least one structural domain that facilitates protein folding and binding- competent presentation of the H1 and H2 alpha helices and H3 beta hairpin domains to the coronavirus spike protein.
  • exemplary de novo proteins of the present invention comprise at least two structural domains that facilitate protein folding and binding-competent presentation of the H1 and H2 alpha helices and H3 beta hairpin domains to the coronavirus spike protein.
  • the H1 and H2 alpha helical domains and optional beta hairpin comprise amino acid residues that interact with/act as binding sites to the coronavirus spike protein.
  • the de novo proteins of the present invention interact with amino acid residues in the receptor binding domain of coronavirus spike protein.
  • the expected binding residues on the RBD are: 442, 443, 461, 462, 463, 470, 471, 472, 473, 475, 476, 479, 481, 482, 483, 486, 487, 488, 489 and 491 and for SARS-CoV-2, the expected binding residues are: 455, 456, 475, 476, 477, 486, 487, 489, 490, 493, 496, 497, 498, 500, 501, 502, 504, and 505.
  • the present invention provides not only the proteins comprising the peptide domains, H1, H2 and H3, but the peptide domains themselves.
  • the present invention provides nucleic acid molecules encoding the de novo proteins and peptide domains of the present invention.
  • the present invention provides nucleic acid molecules encoding any of the proteins and peptide domains described herein.
  • the present invention provides recombinant expression vectors capable of expressing the proteins of the present invention.
  • the present invention includes recombinant expression vectors comprising any of the nucleic acid molecules mentioned above.
  • host cells into which such vectors have been introduced as well as methods of producing the proteins by culturing the host cells under conditions permitting production of the proteins, and recovering the proteins so produced.
  • the invention provides a pharmaceutical composition comprising a therapeutically effective amount of a de novo ACE2 protein decoy of the present invention and a pharmaceutically acceptable carrier.
  • the invention features a composition which is a combination of a de novo protein of the present invention and a second therapeutic agent.
  • the second therapeutic agent is any agent that is advantageously combined with a de novo protein of the present invention.
  • Exemplary agents include, without limitation, other agents that inhibit viral activity including infectivity of host cells.
  • the invention provides therapeutic methods for treating a disease or disorder associated with a coronavirus that uses ACE2 as its receptor to facilitate viral entry into target cells.
  • the methods include, for example, treating a viral infection in a subject using de novo ACE2 protein decoy of the invention, wherein the therapeutic methods comprise administering a therapeutically effective amount of a pharmaceutical composition comprising the ACE2 protein decoy to the subject in need thereof.
  • the disorder treated is any disease or condition which is improved, ameliorated, inhibited or prevented by inhibition of SARS-CoV or SARS-CoV-2 coronavirus activity or activity of any other coronavirus that gains access to its target cells using the ACE2 receptor.
  • the present invention provides methods to prevent, treat or ameliorate at least one symptom of coronavirus infection, the method comprising administering a therapeutically effective amount of a protein of the present invention to a subject in need thereof.
  • the present invention provides methods to ameliorate or reduce the severity of at least one symptom or indication of coronavirus infection in a subject by administering a protein of the invention, wherein at least one symptom or indication is selected from the group consisting of: inflammation in the lung, alveolar damage, fever, cough, shortness of breath, diarrhea, heart failure, arrhythmias, multiple organ dysfunction, pneumonia, septic shock and/or death.
  • the invention provides methods to decrease viral load in a subject, the methods comprising administering to the subject an effective amount of a ACE2 protein decoy of the invention that binds the coronavirus spike protein from SARS-CoV or SARS-CoV-2 or another coronavirus spike protein from a coronavirus that gains entry into its target cells by use of the ACE2 receptor and blocks binding of the spike protein to its host cell receptor.
  • the de novo protein may be administered prophylactically or therapeutically to a subject having or at risk of having a coronavirus infection.
  • the subjects at risk include, but are not limited to, an immunocompromised person, an elderly adult (more than 65 years of age), healthcare workers, adults or children in close contact with a person(s) with confirmed or suspected coronavirus infection, and people with underlying medical conditions such as pulmonary disease or infection, heart disease or diabetes.
  • the de novo proteins of the present invention are administered in combination with a second therapeutic agent to a subject in need thereof.
  • the second therapeutic agent may be, for example, selected from the group consisting of an anti-inflammatory drug (such as corticosteroids, and non-steroidal anti-inflammatory drugs), an anti-infective drug, or an anti-viral drug.
  • the second therapeutic agent may be an agent that helps to counteract or reduce any possible side effect(s) associated with a de novo protein of the invention, if such side effect(s) should occur.
  • the second therapeutic is one to treat cytokine release syndrome (e.g., a cytokine storm).
  • the de novo protein thereof may be administered, for example, subcutaneously, intravenously, intradermally, intraperitoneally, orally, or intramuscularly. In some aspects, the de novo proteins are inhaled.
  • Also included in the present invention are methods for treating a viral infection in a subject using the nucleic acids of the invention.
  • the methods comprise administering a therapeutically effective amount of a nucleic acid encoding an ACE2 protein decoy of the present invention to a subject in need thereof.
  • the present invention also includes use of protein or nucleic acid of the invention in the manufacture of a medicament for the treatment of a disease or disorder that would benefit from the blockade of coronavirus binding and/or activity.
  • the invention provides methods for detecting coronavirus spike protein in a biological sample. The methods comprise the steps of contacting the biological sample with an ACE2 protein decoy of the present invention and detecting coronavirus spike protein in the biological sample.
  • SARS-CoV refers to the Severe Acute Respiratory Syndrome coronavirus that emerged in China in 2002. It binds via the viral spike protein to human host cell receptor ACE2.
  • SARS-CoV-S also called “S protein”, refers to the spike protein of the SARS coronavirus (S1 and S2).
  • SARS-CoV spike protein mediates receptor recognition and membrane fusion.
  • S1 and S2 The receptor binding domain is found in the S1 subunit and it directly binds to the peptidase domain (PD) of ACE2.
  • S2 is responsible for membrane fusion.
  • SARS-CoV-S protein When S1 binds to ACE2, S2 is cleaved by host proteases. This cleavage of S2 is believed to be critical for viral infection.
  • An exemplary SARS-CoV-S protein is provided herein as SEQ ID NO:2.
  • the signal peptide is amino acids 1-13
  • S1 spike protein is amino acids 14-667
  • S2 spike protein is amino acids 668-1255.
  • SARS-CoV-S and SARS-CoV spike protein includes protein variants of SARS-CoV spike protein isolated from different SARS-CoV isolates.
  • SARS-CoV isolates include, for example, Isolate BJ01, Isolate BJ02, Isolate BJ03, Isolate BJ04, Isolate GZ50, Isolate CUHK-W1, Isolate HKU- 36871, Isolate GD01, Isolate GD03, Isolate Shanghai LY, Isolate Frankfurt 1, Isolate FRA, Isolate SZ23, Isolate SZ3, and Isoalte Tor2.
  • COVID-19” or “2019-nCov” or “SARS-CoV-2” refers to a new SARS- like coronavirus that emerged from Wuhan, China in 2019 and was labeled by the WHO as a pandemic on March 11, 2020.
  • SARS-CoV-2-S also called “S protein” refers to the spike protein of the SARS-CoV-2 coronavirus.
  • An exemplary SARS-CoV-2 S protein is provided herein as SEQ ID NO:3.
  • SARS-CoV-2-S and SARS-CoV-2 spike protein includes protein variants of SARS-CoV-2 spike protein isolated from different SARS-CoV-2 isolates.
  • coronavirus ACE2-binding spike protein or “ACE2- binding spike protein” as used herein refers to a coronavirus spike protein that uses ACE2 to mediate its entry into host cells.
  • ACE2 refers to the angiotensin-converting enzyme 2 that acts as a receptor for select coronaviruses.
  • the full length of the human ACE2 protein is provided herein as SEQ ID NO:1.
  • the signal peptide is amino acid residues 1-17; the extracellular PD domain is amino acid residues 18-740; the transmembrane segment is residues 741-761; and the intracellular domain is residues 762-805.
  • the primary physiological role of ACE2 is in the maturation of angiotensin, however, it has also been hijacked as a cellular receptor for some coronaviruses.
  • coronavirus infection or “SARS-CoV infection” or “SARS-CoV-2 infection”, as used herein, refers to infection by a coronavirus that use the ACE2 receptor to gain entry into host cells, and in particular, SARS or COVID-19 coronavirus. Severe acute respiratory illness is associated with both SARS and COVID-19 coronavirus infection. A wide range of additional complications are associated with COVID-19 coronavirus, including thrombotic events and neurological disease.
  • Symptoms of infection include fever, cough, shortness of breath pneumonia, gastro-intestinal symptoms such as diarrhea, organ failure (kidney failure, heart failure, and renal dysfunction), septic shock and death in severe cases.
  • recombinant refers to proteins of the invention created, expressed, isolated or obtained by technologies or methods known in the art as recombinant DNA technology which include, e.g., DNA splicing and transgenic expression.
  • a first moiety specifically binds a second moiety and the resulting complex is relatively stable under physiologic conditions.
  • Specific binding can be characterized, in some embodiments, by an equilibrium dissociation constant of about 650 nM or less, or in some embodiments, 100 nM or less (e.g., a smaller KD denotes a tighter binding).
  • Methods for determining whether two moieties specifically bind are well known in the art and include, for example, equilibrium dialysis, surface plasmon resonance, and the like. As described herein, proteins have been identified which bind specifically to coronavirus spike protein, in particular, proteins that bind specifically to coronavirus ACE2-binding spike protein.
  • the phrase “therapeutically effective amount” refers to an amount that produces the desired effect for which it is administered. The exact amount will depend on the purpose of the treatment and will be ascertainable by one skilled in the art using known techniques (see, for example, Lloyd (1999) The Art, Science and Technology of Pharmaceutical Compounding).
  • the term “subject” refers to an animal, preferably a mammal, more preferably a human, in need of amelioration, prevention and/or treatment of a disease or disorder such as viral infection. The term includes human subjects who have or are at risk of having a coronavirus infection.
  • the terms “treat”, “treating”, or “treatment” refer to the reduction or amelioration of the severity of at least one symptom or indication of a coronavirus infection due to the administration of a therapeutic agent such as a protein of the present invention to a subject in need thereof.
  • the terms include inhibition of progression of disease or of worsening of infection.
  • the therapeutic agent may be administered at a therapeutic dose to the subject.
  • the term “prevent”, “preventing” or “prevention” refers to inhibition of the onset of symptoms of a coronavirus infection. In some embodiments, prevention encompasses inhibition of a coronavirus infection and/or inhibition of the spread of coronavirus infection from a subject to another individual.
  • anti-viral drug refers to any anti-infective drug or therapy used to treat, prevent, or ameliorate a viral infection in a subject.
  • anti- viral drug includes, but is not limited to ribavirin, remdesivir, oseltamivir, zanamivir, interferon-alpha2b, analgesics and corticosteroids.
  • the viral infections include infection caused by human coronaviruses, including SARS-CoV and SARS-CoV-2.
  • identity refers to the amino acid sequence identity between two molecules.
  • Sequence identity can be measured, for example, using sequence analysis software such as the Sequence Analysis Software Package of the Genetics Computer Group at the University of Wisconsin Biotechnology Center (1710 University Avenue, Madison, WI 53705), with the default parameters thereof.
  • sequence analysis software such as the Sequence Analysis Software Package of the Genetics Computer Group at the University of Wisconsin Biotechnology Center (1710 University Avenue, Madison, WI 53705), with the default parameters thereof.
  • identity it is also important to consider positioning of the binding interface residues with the coronavirus spike protein. If amino acids are added or deleted, it should be done in such a way that doesn’t substantially interfere with presentation of the protein to its binding partner or with secondary structure. Unless indicated otherwise, percent identity is determined across the length of the reference sequence.
  • amino acid substitutions relative to the reference peptide domains can be, for example, conservative amino acid substitutions.
  • conservative amino acid substitution means a given amino acid can be replaced by an amino acid having similar physiochemical characteristics, e.g., substituting one aliphatic residue for another (such as Ile, Val, Leu, or Ala for one another), or substitution of one polar residue for another (such as between Lys and Arg; Glu and Asp; or Gln and Asn). Other such conservative substitutions, e.g., substitutions of entire regions having similar hydrophobicity characteristics, are known. Polypeptides comprising conservative amino acid substitutions can be tested in any one of the assays described herein to confirm that a desired activity is retained. Amino acids can be grouped according to similarities in the properties of their side chains.
  • Naturally occurring residues can be divided into groups based on common side-chain properties.
  • Non-conservative substitutions will entail exchanging a member of one of these classes for a member of another class.
  • Particular conservative substitutions include, for example; Ala to Gly or Ser; Arg to Lys; Asn to Gln or H; Asp to Glu; Cys to Ser; Gln to Asn; Glu to Asp; Gly to Ala or Pro; His to Asn or Gln; Ile to Leu or Val; Leu to Ile or Val; Lys to Arg, Gln or Glu; Met to Leu, Tyr or Ile; Phe to Met, Leu or Tyr; Ser to Thr; Thr to Ser; Trp to Tyr; Tyr to Trp; and/or Phe to Val, Ile or Leu.
  • a common hydrophobic grouping is glycine (Gly), alanine (Ala), valine (Val), leucine (Leu), isoleucine (Ile), proline (Pro), and phenylalanine (Phe).
  • the natural amino acid residues are abbreviated as follows: alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gln; Q), glycine (Gly; G), histidine (His; H), isoleucine (Ile; I), leucine (Leu; L), lysine (Lys; K), methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), tryptophan (T
  • any amino acid typically refers to the 20 natural amino acids. The skilled practitioner will appreciate, however, that one or more, (e.g., from 1 to 10, 1 to 5, 1 to 3, or 1 or 2) unnatural amino acids can be used in place of a natural amino acid.
  • unnatural amino acid refers to an amino acid other than the 20 amino acids that occur naturally in protein. Unnatural amino acids are known in the art.
  • polypeptide As used herein, the terms “polypeptide”, “protein” or “peptide” refer to any chain of amino acid residues, regardless of its length or post-translational modification (e.g., glycosylation or phosphorylation).
  • “Operably linked” is intended to mean that the nucleotide sequence of is linked to the regulatory sequence(s) in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
  • “Regulatory sequences” include promoters, enhancers, and other expression control elements (e.g., polyadenylation signals).
  • the expression constructs of the invention can be introduced into host cells to thereby produce the proteins disclosed herein.
  • the terms “host cell” and “recombinant host cell” are used interchangeably herein.
  • transformation and “transfection” refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, particle gun, or electroporation.
  • the term “pharmaceutically acceptable carrier” includes, but is not limited to, saline, solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, compatible with pharmaceutical administration. Supplementary active compounds (e.g., antibiotics) can also be incorporated into the compositions.
  • GENERAL [0060] The inventors have described herein de novo ACE2 protein decoys that specifically bind to the spike protein on coronaviruses and modulate the interaction of the spike proteins with their innate receptor.
  • the de novo ACE2 protein decoys bind to the spike protein on those coronaviruses that use ACE2 as their receptor to facilitate viral entry into target cells.
  • the de novo ACE2 protein decoys of the present invention contain the domains necessary for interacting with and binding to coronavirus spike protein, they typically do not contain domains associated with other enzymatic activities of native ACE2, including for example the catalytic domain (e.g., metalloprotease catalytic domain) of ACE2.
  • Exemplary de novo ACE2 protein decoys of the present invention do not catalyze the cleavage of angiotensin (i.e., any forms of angiotensin, including angiotensin I and II).
  • the ACE2 protein decoys provided herein have no more than 60%, 55%, 50%, 45%, 40%, or 35% sequence identity to human ACE2 (SEQ ID NO: 1). With regard to identity to ACE2, percent identity is calculated with the ACE2 protein decoy as query and ACE2 as reference, over the length of the query.
  • protein decoys of the present invention bind to the spike protein (e.g., spike protein from SARS-CoV-2) with a Kd of 700 nM or less, Kd of 600 nM or less, Kd of 500 nM or less, Kd of 400 nM or less, Kd of 300 nM or less, Kd of 200 nM or less, K d of 100 nM or less, K d of 50 nM or less, preferably about 20 nM or less, or even 15 nM, 10 nM, or 5 nM or less.
  • Methods of determining K d are known in the art and described in the examples.
  • the de novo protein decoys of the present invention are blocking proteins in that they may bind to the coronavirus spike protein (e.g., from SARS-CoV-2) and block the interaction of the spike protein with their native receptor (i.e., ACE2).
  • blocking proteins may completely block the interaction of the spike protein with their native receptor or may partially block the interaction of the spike protein with their native receptor.
  • the de novo protein decoys inhibit the interaction of the spike protein with their native receptor (i.e., ACE2) with an IC 50 of 100 nM or less.
  • the de novo proteins inhibit the interaction of the spike protein with their native receptor (i.e., ACE2) with an IC 50 of 50, 40, 30, 20, 10, or 5 nM or less.
  • the blocking proteins of the invention block the binding of the coronavirus spike protein to its receptor and/or inhibit or neutralize or reduce viral infectivity of host cells.
  • the blocking proteins may be useful for treating a subject suffering from a coronavirus infection (e.g., COVID-19 coronavirus infection).
  • the de novo protein decoys of the present invention when administered to a subject in need thereof, reduce the infection by a coronavirus such as SARS-CoV-2 in the subject.
  • Protein decoys of the present invention may be used alone or as adjunct therapy with other therapeutic moieties or modalities known in the art for treating viral infection.
  • the ability of the de novo proteins of the invention to bind to and inhibit/neutralize the activity of SARS-CoV or SARS-CoV-2 may be measured using any standard method known to those skilled in the art, including binding assays, or activity assays, as described herein. For example, in vitro assays for measuring binding and inhibition and/or blocking activity are illustrated in examples.
  • the de novo protein decoys of the present invention may contain no additional labels or moieties, or they may contain labels (e.g., an N-terminal or C-terminal label or moiety).
  • the label or moiety is biotin.
  • the location of a label may determine the orientation of the peptide relative to the surface upon which the peptide is bound. For example, if a surface is coated with avidin, a peptide containing an N-terminal biotin will be oriented such that the C-terminal portion of the peptide will be distal to the surface.
  • the label may be, for example, an enzyme, a radionuclide, a fluorescent dye or a MRI-detectable label.
  • Such labeled proteins may be used in diagnostic assays including imaging assays.
  • the label may be a C-terminal peptide that allows for detection of protein by absorbance at 280 nm (e.g., GSGWGSG, SEQ ID NO:248).
  • STRUCTURAL AND SEQUENCE CHARACTERISTICS OF THE ACE2 PROTEIN DECOYS [0063] Provided herein are de novo proteins of the present invention, referred to herein as ACE2 protein decoys.
  • These protein decoys are, by nature, non-naturally occurring proteins, and comprise at least two alpha helical domains that interface with the coronavirus spike protein and preferably at least one structural domain that facilitates protein folding and binding-competent presentation of the alpha helices.
  • the de novo proteins further comprise an optional beta hairpin domain.
  • the alpha helical domains that interface with the coronavirus spike protein and optional beta-hairpin domain interact with/act as binding sites to the coronavirus spike protein.
  • These domains referred to herein as H1, H2, and H3, comprise both amino acid residues that engage in binding interactions with the coronavirus spike protein and amino acid residues that do not engage in binding interactions with the coronavirus spike protein.
  • those amino acid residues that do not engage in binding interactions with the coronavirus ACE2- binding spike protein are at positions that can be very promiscuous with respect to the identity of the amino acid that sits at that position. A number of these residues are also at solvent exposed positions. In some embodiments, when replacing amino acids at solvent exposed positions, the use of hydrophilic amino acids are particularly desirable, although non-hydrophilic amino acids are acceptable as well.
  • the de novo proteins of the present invention were designed such that the binding domains align structurally to the corresponding binding sites in the native ACE2 protein whereas the supporting structural domains do not structurally or sequentially align to any other secondary structures in ACE2.
  • the de novo proteins of the present invention structurally align to the native ACE2 binding motifs within, for example, 2.75 A RMSD (root mean square deviation) and contain one or more secondary structure elements that do not structurally or sequentially align to any other secondary structure elements in ACE2.
  • Methods of determining RMSD are known in the art, for example using the MICAN protein structure alignment algorithm.
  • the de novo proteins of the present invention comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H1 comprises the amino acid sequence SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); H2 comprises the amino acid sequence NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5); and H3, if present, comprises the amino acid sequence X 28
  • Such de novo proteins further comprise one or more structural domain that facilitates protein folding and binding-competent presentation of H1, H2, and H3.
  • the coronavirus-binding amino acid residues of the ACE2 protein decoys are the same as the coronavirus binding amino acid residues of the ACE2 protein.
  • the binding residues of H1, H2 and H3 are identical to the amino acid at the same structural position in the native ACE2 protein.
  • all, or all but one to six, or one to five, or one to four, or one to three of the solvent exposed amino acids of H1 and all, or all but one to six, or one to five, or one to four, or one to three of the solvent exposed amino acids of H2, whether or not they are involved in binding, are the same as the amino acids at the same structural position in the native ACE2 protein.
  • amino acid residues at positions 1, 5, 6, 9, 10, 12, 13, 16, 17, 19, 20, 23, 24 and 27 are, for the most part, solvent exposed and/or involved in binding to coronavirus ACE2-binding spike protein; in SEQ ID NO: 176, the amino acid residues at positions 3, 4, 7, 8, 10, 11, 14, 15, 17, 18, 21, 22 and 25 are, for the most part, solvent exposed and/or involved in binding to coronavirus ACE2-binding spike protein; in SEQ ID NO:5, the amino acid residues at positions 1, 4, 8, 12, 15, 16, 19, 22 and 23 are, for the most part, solvent exposed and/or involved in binding to coronavirus ACE2-binding spike protein; and in SEQ ID NO:6 the amino acid residues at positions 7, 8, and 9 are, for the most part, solvent exposed and/or involved in binding to coronavirus ACE2-binding spike protein.
  • these residues are not substituted. In other aspects, it may be desirable to modify these residues. For example, modifications to these residues can be made in order to create a protein that binds to ACE2 with a higher affinity than native coronavirus spike protein. Such a protein can be used, for example, for diagnostic purposes.
  • no more than 4, no more than 3, no more than 2 or no more than 1 of the residues at positions 1, 5, 6, 9, 10, 12, 13, 16, 17, 19, 20, 23, 24 and 27 of SEQ ID NO:4 are substituted or no more than 4, no more than 3, no more than 2 or no more than 1 of the residues at positions 3, 4, 7, 8, 10, 11, 14, 15, 17, 18, 21, 22 and 25 of SEQ ID NO:176 are substituted; no more than 3, no more than 2 or no more than 1 of the residues at positions 1, 4, 8, 12, 15, 16, 19, 22 and 23 of SEQ ID NO:5, are substituted; and no more than 1 of the residues of 7, 8, 9 or 11 in SEQ ID NO:6 is substituted or no more than 1 of the residues of 7, 8, or 9 in SEQ ID NO:6 is substituted.
  • substitutions are with amino acids selected from D, E, G, K, N, P, Q, R, S, or T. In some such aspects, substitutions are with conservative amino acids. In other aspects, one or more of the following substitutions are made in SEQ ID NO: 4: S1I; E5D; E5Q; E5V; D12V, D12E; Q24K; and Q24L. In other aspects, one or more of the following substitutions are made in SEQ ID NO: 176: E3D; E3Q; E3V; D10V, D10E; Q22K; and Q22L.
  • residues identified in SEQ ID NOs: 4, 176, 5, and 6 as X are not solvent exposed and/or are not directly involved in binding to coronavirus ACE2-binding spike protein. These residues typically may be different from the corresponding amino acids in ACE2. Included in the present invention are those proteins wherein not more than half of, or no more than 8 of, the amino acids represented as X (with a subscript numeral) are the same amino acid as the corresponding position in native ACE2 represented by SEQ ID NO:1.
  • not more than 4, 3 or 2, of the amino acids represented as X (with a subscript numeral) in H1; not more than 4, 3 or 2, of the amino acids represented as X (with a subscript numeral) in H2; and/or not more than 5, 4, 3 or 2, of the amino acids represented as X (with a subscript numeral) in H3, are the same as the corresponding position in native ACE2.
  • An exemplary corresponding sequence of H1 in ACE2 is ST IEEQAKTFLD KFNHEAEDLF YQSSL (SEQ ID NO:7); an exemplary corresponding sequence of H2 in ACE2 is NMNNAGDKWS AFLKEQSTLA QMY (SEQ ID NO:8); an exemplary corresponding sequence of H3 in ACE2 is TAWD LGKGDFRIL (SEQ ID NO:9).
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein: H1 comprises the amino acid sequence: SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); wherein X 1 , X 2 , X 3 , X 4 , X 5 , X 6 , X 7 , X 8 , X 9 , X 10 , X 11 , X 12 , X 13 are each independently selected from any amino acid
  • H1 comprises the amino acid sequence: SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); wherein: X 1 is an amino acid selected from A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably A, C, E, F, G, I, L, M, N, Q, R, S, T, V, or Y); X 2 is an amino acid selected from A, C, D, E, F, G, H, I, K, L, M, N, N, Q, R, S, T, V, or Y); X 2
  • De novo ACE2 protein decoys of the present invention include those above wherein phenylalanine is included in the list of amino acids for X 1 , asparagine is included in the list of amino acids for X 2 , histidine is included in the list of amino acids for X 8 , and asparagineis included in the list of amino acids for X 11.
  • De novo ACE2 protein decoys of the present invention include those above wherein phenylalanine is included in the list of amino acids for X 1 , asparagine is included in the list of amino acids for X 2 , asparagine is removed from the list of amino acid substitutions for X 3 , X 4 is an amino acid selected from I, L, M, P, V, or T; aspartic acid is removed from the list of amino acid substitutions for X 5 , X 6 is an amino acid selected from A, L, M, or V; proline is removed from the list of amino acid substitutions for X 7 , histidine is included in the list of amino acids for X 8 and lysine and proline are removed, proline is removed from the list of amino acids for X 9 , asparagine is included in the list of amino acids for X 11 and alanine, isoleucine, proline and valine are removed, and proline is removed from the list of amino acid substitutions for X 12 and X
  • X 4 is an amino acid selected from A, F, I, L, M, P or V (preferably I, L, M or V); and X 1 , X 2, X 3, X 5, X 6, X 7, X 8, X 9, X 10, X 11, X 12, and X 13 are as described in any of the embodiments provided herein;
  • X 4 is an amino acid selected from A, F, I, L, M, P or V (preferably I, L, M or V);
  • X 5 is an amino acid selected from D, E, G, K, N, P, Q, R, S, or T (preferably E, G, K, N, P, Q, R, S, or T); and X 1 , X 2 ,X 3 , X 6 , X 7 , X 8 ,X 9 , X 10 ,X 11 ,X 12 , and X 13 are
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein: H1 comprises the amino acid sequence: SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); wherein: X 1 and X 5 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; X 2, X 4, X 6, X 9, X 11, X 12, and X 13, are each independently an amino acid selected from A, F, I, L, M, P or V; X 3, X 7, and X 10, are each independently an amino acid selected from A
  • De novo proteins of the present invention include those above wherein phenylalanine is included in the list of amino acids for X 1 , asparagine is included in the list of amino acids for X 2 , histidine is included in the list of amino acids for X 8 , and asparagine is included in the list of amino acids for X 11.
  • De novo proteins of the present invention include those above wherein phenylalanine is included in the list of amino acids for X 1 , asparagine is included in the list of amino acids for X 2 , asparagine is removed from the list of amino acid substititions for X 3 , X 4 is an amino acid selected from I, L, M, P, or V; asparatic acid is removed from the list of amino acid substitions for X 5 , X 6 is an amino acid selected from A, L, M, or V; proline is removed from the list of amino acid substitions for X 7 , histidine is included in the list of amino acids for X 8 and lysine and proline are removed; proline is removed from the list of amino acids for X 9 , asparagine is included in the list of amino acids for X 11 and alanine, isoleucine, proline and valine are removed, and proline is removed from the list of amino acid substitions for X 12 and X 13.
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H1 comprises the amino acid sequence SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); wherein: X 1 is an amino acid selected from R or S; X 3 is an amino acid selected from R or L; X 7 is an amino acid selected from A or T; X 10 is an amino acid selected from R or S or L (preferably R or S); X 2, X 4, X 6, X 9, X 11, X 12, and X 13, are each independently an amino acid selected from A, F, I, L, M
  • De novo proteins of the present invention include those above wherein phenylalanine is included in the list of amino acids for X 1 , asparagine is included in the list of amino acids for X 2 , histidine is included in the list of amino acids for X 8 , and asparagine is included in the list of amino acids for X 11.
  • De novo proteins of the present invention include those above wherein phenylalanine is included in the list of amino acids for X 1 , asparagine is included in the list of amino acids for X 2 , asparagine is removed from the list of amino acid substititions for X 3 , X 4 is an amino acid selected from I, L, M, P, or V; asparatic acid is removed from the list of amino acid substitions for X 5 , X 6 is an amino acid selected from A, L, M, or V; proline is removed from the list of amino acid substitions for X 7 , histidine is included in the list of amino acids for X 8 and lysine and proline are removed; proline is removed from the list of amino acids for X 9 , asparagine is included in the list of amino acids for X 11 and alanine,isoleucine, proline and valine are removed, and proline is removed from the list of amino acid substitions for X 12 and X 13.
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H1 comprises the amino acid sequence SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); wherein: X 1 is an amino acid selected from R or S; X 2 is V; X 3 is an amino acid selected from R or L; X 4 is leucine; X 5 is lysine; X 6 is alanine; X 7 is an amino acid selected from A or T; X 8 is phenylalanine; X 9 is methionine; X 10 is an amino acid sequence SX
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H1 comprises the amino acid sequence SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); wherein: X 1 is an amino acid selected from R or S; X 2 is valine; X 3 is an amino acid selected from R or L or C or S; X 4 is leucine; X 5 is lysine; X 6 is alanine; X 7 is an amino acid selected from A or T; X 8 is phenylalanine; X 9 is M or L; X 10
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H1 comprises the amino acid sequence SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); wherein: X 1 is an amino acid selected from R or S or F; X 2 is an amino acid selected from N or V; X 3 is an amino acid selected from R or L or C or S; X 4 is leucine; X 5 is lysine; X 6 is alanine; X 7 is an amino acid selected from A or T; X 8 is an amino acid selected from F or H;
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein: H1 comprises the amino acid sequence: SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:4) or X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L (SEQ ID NO:176); wherein (i) X 8 is not valine or (ii) X 8 is not an an amino acid selected from V, A, I, L, M, or P or (iii) X 8 is phenylalanine; and X 1, X 2, X 3, X 4, X 5, X 6, X 7, X 9, X 10, X 11, X 12, and X 13 are as described here
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H1 comprises the amino acid sequence: SX 1 X 2 X 3 EQX 4 X 5 TFX 6 DKX 7 X 8 HEX 9 EDX 10 X 11 YQX 12 X 13 L X 37 X 38 X 39 X 40 X 41 X 42 X 43 X 44 X 45 X 46 (SEQ ID NO:10); wherein X 1 - X 13 are as provided in the any of the embodiments herein for H1; and X 37 ,X 38 , X 41 , X 42 , X 45 , and X 46 are each independently selected from any amino acid.
  • X 1 - X 13 are as provided in the any of the embodiments herein for H1; and X 37 , X 38 , X 41 , X 42 , X 45 , and X 46 are each independently an amino acid selected from A, F, I, L, M, P or V; X 40 and X 44 are each independently an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; X 39 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably A, F, I, L, M, P or V); X 43 is an amino acid selected from D, E, G, K, N, P, Q, R, S, or T.
  • De novo proteins of the present invention include those above wherein isoleucine is removed from the list of amino acids for X 37 , proline is removed from the list of amino acids for X 38 ;
  • X 39 is an amino acid selected from F, I, L, M, V, W, or Y;
  • histidine is included in the list of amino acids for X 40 and proline is removed, proline is removed from the list of amino acids for X 41 ;
  • the amino acids, T and Y are included in the list of amino acids for X 42 .
  • X 42 is an amino acid selected from A or V.
  • ACE2 protein decoys wherein X 1 , X 2 , X 3 , X 4 , X5, X 6 , X 7 , X 8 , X 9 , X 10 , X 11 , X 12 , X 13 are as described in any of the embodiments herein and one or more (preferably not more than 1 to 3) of the amino acids at positions 1, 5, 6, 9, 10, 12, 13, 16, 17, 19, 20, 23, 24 and 27 of SEQ ID NO:4 are substituted (or the equivalent positions in SEQ ID NO:176).
  • substitutions are made: S1I; E5D; E5Q; E5V; D12V, D12E; Q24L; and Q24K.
  • one of the following substitutions is made in SEQ ID NO:4: S1I; E5D; E5Q; E5V; D12V, D12E; Q24L; and Q24K (or the equivalent positions in SEQ ID NO:176).
  • one or more of the following substitutions are made in SEQ ID NO:4: T9F, D12I, D12N, E17I, Y23H or the equivalent positions in SEQ ID NO:176).
  • de novo proteins of the present invention include those described herein provided that X 4 is not an amino acid selected from D, E, G, H, K, N, P, Q, R, S, W, or Y; X 6 is not an amino acid selected from D, E, F, H, K, P, Q, R, W, or Y; X 7 is not P; X 8 is not proline; X 9 is not an amino acid selected from K, P, or R; X 11 is not an amino acid selected from D E, K, P, R, T or V; X 12 is not an amino acid selected from K, P, or R; X 13 is not an amino acid selected from P or W; X 38 is not an amino acid selected from H, K, P, or R;X 39 is not an amino acid selected from D, E, G, K, or P; and/or X 41 is not proline.
  • ACE2 protein decoys of the present invention include those wherein H1 comprises an amino acid sequence having at least 70%, 80%, 90%, 95% or 100% identity to an amino acid sequence set forth in SEQ ID NO:11-17, 177-183, 198, or 199: SRVLEQLKTFADKAFHEMEDRFYQAAL (SEQ ID NO:11) SSVREQLKTFADKAFHEMEDRFYQAAL (SEQ ID NO:12) ⁇ 44 SRVREQLKTFADKTFHEMEDRFYQAAL (SEQ ID NO:13) SRVREQLKTFADKAFHEMEDSFYQAAL (SEQ ID NO:14) SSVLEQLKTFADKAFHEMEDRFYQAAL (SEQ ID NO:15) SSVLEQLKTFADKTFHEMEDSFYQAAL (SEQ ID NO:16) SRVREQLKTFADKAFHEMEDRFYQAAL (SEQ ID NO:17) VLEQLKTFADKAFHEMEDRFYQAAL (SEQ ID NO:177
  • H1 comprises additional amino acids at the C terminuts (e.g., an additional 11 amino acids at the C terminus).
  • H1 herein (including H1 having an amino acid sequence having at least 70%, 80%, 90%, 95% or 100% identity to an amino acid sequence set forth in SEQ ID NO:11-17, 177-183, 198, or 199), the amino acid sequence AVFEAAEAAAG (SEQ ID NO:18), AVWEAAEAAAG (SEQ ID NO:19), AVFEAVEAAAG (SEQ ID NO:249) or AVWEAVEAAAG (SEQ ID NO:250) can be optionally present at the C terminus.
  • H1 comprises a sequence having at least 70%, 80%, 90%, 95% or 100% identity to the sequence set forth in SEQ ID NOS.240-243 or 251-254: SSVLEQLKTFADKAFHEMEDRFYQAALAVFEAAEAAAG (SEQ ID NO:240), VLEQLKTFADKAFHEMEDRFYQAALAVFEAAEAAAG (SEQ ID NO:241); SSVLEQLKTFADKAFHEMEDLFYQAALAVFEAAEAAAG (SEQ ID NO:242); VLEQLKTFADKAFHEMEDLFYQAALAVFEAAEAAAG (SEQ ID NO:243); SSVLEQLKTFADKAFHEMEDRFYQAALAVFEAVEAAAG (SEQ ID NO:251), VLEQLKTFADKAFHEMEDRFYQAALAVFEAVEAAAG (SEQ ID NO:252); SSVLEQLKTFADKAFHEMEDLFYQAALAVFEAVEAAAG (SEQ ID NO:253); or
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein X 14 , X 15 , X 16 , X 17 , X 18 , X 19 , X 20 , X 21 , X 22 , X 23 , X 24 , X 25 , X 26 , X 27 are each independently selected from any amino acid.
  • H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5)
  • X 14 is an amino acid selected from A, C, D, G, H, I, L, M, N, P, R, S, T, V, or W (preferably A, C, G, P, T, or V)
  • X 15 is an amino acid selected from A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably A, C, D, E, F, I, K, L, M, N, Q, R, S, T, or V)
  • X 16 is an amino acid selected from A, C, D, E, F, G, H, I, K, L, M, M, M,
  • H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein X 15 , X 18 , X 21 , X 23 , X 25 , and X 27 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; X 14 , X 16 , X 17 , X 19 , X 22 , X 24 , and X 26 are each independently an amino acid selected from A, F, I, L, M, P or V; and X 20 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y.
  • De novo proteins include those above wherein phenylalanine is removed from the list of amino acids for X 14 and X 17 , tyrosine is included in the list of amino acids for X 18 , glutamic acid and proline are removed from the list of amino acids for X 19 , proline is removed from the list of amino acids for X 20 , X 22 , X 23 , X 24 and X 25 , methionine is included in the list of amino acids for X 21 , histidine is included in the list of amino acids for X 23, methionine and proline are removed from the list of amino acids for X 26 , and isoleucine is included in the list of amino acids for X 27 and lysine and proline are removed.
  • H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein X 15 , X 18 , X 21 , X 23 , X 25 , and X 27 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; X 14 , X 17 , X 22 , X 24 , and X 26 are each independently an amino acid selected from A, F, I, L, M, P or V; X 16 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; X 19 is an amino acid selected from A, F, I, L, M, P,
  • De novo proteins include those above wherein phenylalanine is removed from the list of amino acids for X 14 and X 17 , tyrosine is included in the list of amino acids for X 18 , glutamic acid and proline are removed from the list of amino acids for X 19 , proline is removed from the list of amino acids for X 20 , X 22 , X 23 , X 24 and X 25 , methionine is included in the list of amino acids for X 21 , histidine is included in the list of amino acids for X 23, methionine and proline are removed from the list of amino acids for X 26 , and isoleucine is included in the list of amino acids for X 27 and lysine and proline are removed.
  • H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein X 15 , X 18 , X 21 , X 23 , X 25 , and X 27 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; X 14 , X 17 , X 19 , X 22 , X 24 , and X 26 are each independently an amino acid selected from A, F, I, L, M, P or V; X 16 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably from A, F, I, L, M, P, E,
  • De novo proteins include those above wherein phenylalanine is removed from the list of amino acids for X 14 and X 17 , tyrosine is included in the list of amino acids for X 18 , glutamic acid and proline are removed from the list of amino acids for X 19 , proline is removed from the list of amino acids for X 20 , X 22 , X 23 , X 24 and X 25 , methionine is included in the list of amino acids for X 21 , histidine is included in the list of amino acids for X 23, methionine and proline are removed from the list of amino acids for X 26 , and isoleucine is included in the list of amino acids for X 27 and lysine and proline are removed..
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein X 14 is A or V X 16 is A or E (or X 16 is A or E or N) X 20 is K or Q X 15 , X 18 , X 21 , X 23 , X 25 , and X 27 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; and X 17 , X 19 , X 22 , X 24 , and X 26 are each independently an amino acid selected from A, F, I, L, M, P or V.
  • De novo proteins of the present invention include those above wherein for X 17 , tyrosine is included in the list of amino acids for X 18 , proline is removed from the list of amino acids for X 19 , proline is removed from the list of amino acids for X 22 , X 23 , X 24 and X 25 , methionine is included in the list of amino acids for X 21 , histidine is included in the list of amino acids for X 23, methionine and proline are removed from the list of amino acids for X 26 , andisoleucine is included in the list of amino acids for X 27 and lysine and proline are removed.
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein: X 14 is A or V; X 15 is E; X 16 is A or E; X 17 is A; X 18 is R; X 19 is A; X 20 is K or Q; X 21 is E; X 22 is A; X 23 is E X 24 is A; X 25 is K; X 26 is A; and X 27 is D.
  • H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein:
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein X 14 , X 17 , X 19 , X 22 , X 24 , and X 26 are each independently an amino acid selected from A, F, I, L, M, P or V; X 15 E; X 16 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably from A, F, I, L, M, P, E, or V); X 18 is R X 20 is an amino acid selected from A, D, E, F, G, I, K, L, M, M,
  • De novo proteins of the present invention include those above wherein phenylalanine is removed from the list of amino acids for X 14 and X 17 , proline is removed from the list of amino acids for X 20 , X 22 , and X 24 , and methionine and proline are removed from the list of amino acids for X 26 .
  • X 14 is an amino acid selected from A or V
  • X 16 is an amino acid selected from A or E
  • X 20 is an amino acid selected from K or Q.
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein X 14 is an amino acid selected from A or V X 16 is an amino acid selected from A or E X 20 is an amino acid selected from K or Q X 15 , X 17 , X 18 , X 19 , X 21 , X 22 , X 23 , X 24 , X 26 X 25 , X 26 and X 27 can be as provided in any of the embodiments herein.
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H2 comprises the amino acid sequence: NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MY (SEQ ID NO:5) wherein (i) X 15 is E and X 14 , X 16 , X 17 , X 18 , X 19 , X 20 , X 21 , X 22 , X 23 , X 24 , X 25, X 26 , and X 27 are as provided in the any of the embodiments herein for H2; (ii) X 18 is R and X 14 , X 15 , X 16 , X 17 , X 19 , X 20 , X 21 , X 22 , X 23 , X 24 , X 25 ,
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H2 comprises the amino acid sequence: X 47 X 4 8X 49 X50X51NX 14 X 15 NX 16 X 17 X 18 KX 19 X 20 X 21 FX 22 X 23 EQX 24 X 25 LX 26 X 27 MYX 52 X53X54X 55 X56 (SEQ ID NO:20) wherein X 14 - X 27 are as provided in the any of the embodiments herein for H2; and X 47 , X 48, X 49, X 50, X 51, X 52, X 53, X 54, X 55, and X 56 are are each independently selected from any amino acid.
  • X 14 - X 27 are as provided in the any of the embodiments herein for H2; and X 49 , X 52 , and X 55 are each independently an amino acid selected from A, F, I, L, M, P or V; X50 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; X 54 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably A, F, I, L, M, P or V); and X 47 , X 48, X 51, X 53, and X 56 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T.
  • De novo proteins of the present invention include those above wherein cysteine and glutamine are included in the list of amino acids for X 47 and lysine is removed, phenylalanine and tyrosine are included in the list of amino acids for X 48 , histidine is included in the list of amino acids for X 52 and X 54 , and aspartic acid is removed from the list of amino acids for X 56 .
  • ACE2 protein decoys wherein X 14 - X 27 are as provided in the any of the embodiments herein for H2; X 49 , X 52 , and X 55 are each independently an amino acid selected from A, F, I, L, M, P or V; X 50 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; X 54 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably A, F, I, L, M, P or V); and X 48, X 51, X 53, and X 56 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; and X 47 is G.
  • De novo proteins of the present invention include those above wherein cysteine and glutamine are included in the list of amino acids for X 47 and lysine is removed, phenylalanine and tyrosine are included in the list of amino acids for X 48 , histidine is included in the list of amino acids for X 52 and X 54 , and aspartic acid is removed from the list of amino acids for X 56.
  • ACE2 protein decoys wherein wherein X 14 - X 27 are as provided in the any of the embodiments herein for H2; X 49 , X 52 , and X 55 are each independently an amino acid selected from A, F, I, L, M, P or V; X50 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; X 54 is L; and X 48, X 51, X 53, and X 56 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; and X 47 is G.
  • De novo proteins of the present invention include those above wherein phenylalanine and tyrosine are included in the list of amino acids for X 48 , histidine is included in the list of amino acids for X 52 , and aspartic acid is removed from the list of amino acids for X 56 .
  • X 47 is G or E; X 48 is D; X 49 is A; X 50 is A; and X 51 is R. In some such aspects, X 47 is G. In other such aspects, X 47 is E.
  • X 52 is A; X53 is E; X54 is L or F or N; X 55 is A; and X56 is K. In some such aspects, X54 is L.
  • X54 is F. In yet other aspects, X54 is N.
  • ACE2 protein decoys wherein one or more (preferably not more than 1 to 3) of the amino acids at positions 1, 4, 8, 12, 15, 16, 19, 22 and 23 of SEQ ID NO:5 are substituted and X 14 , X 15 , X 16 , X 17 , X 18 , X 19 , X 20 , X 21 , X 22 , X 23 , X 24 , X 25 , X 26 , X 27 are as described in any of the embodiments provided herein. For example, in some aspects, the following substitution is made E15G.
  • De novo proteins of the present invention include those wherein X 4 is not an amino acid selected from K or R; X 22 is not an amino acid selected from E, K, Q, or R; X 23 is not an amino acid selected from P; X 24 is not an amino acid selected from D or P; X 26 is not an amino acid selected from D, E, H, K, P, Q, or R; X 9 is not K, P, or R; and X 27 is not P.
  • De novo proteins of the present invention include those wherein H2 comprises an amino acid sequence having at least 70%, 80%, 90%, 95% or 100% identity to an amino acid sequence set forth in: NAENAARKAKEFAEEQAKLADMY (SEQ ID NO:21) NVENEARKAQEFAEEQAKLADMY (SEQ ID NO:22)
  • ACE2 protein decoys of the present invention include those wherein H2 comprises an amino acid sequence having at least 70%, 80%, 90%, 95% or 100% identity to an amino acid sequence set forth in SEQ ID NO:21 wherein the amino acid at position 1 is N or if substituted is A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W, or Y; wherein the amino acid at position 2 is A or if substituted is C, D, G, H, I, L, M, N, P, R, S, T, V, or W; wherein the amino acid at position 3 is E or
  • H2 comprises additional amino acids at the N terminus (e.g., at least 5 additional amino acids at the N terminus).
  • the amino acid sequence EDAAR (SEQ ID NO:23) or GDAAR (SEQ ID NO:24) is present at the N terminus.
  • H1 comprises a sequence having at least 70%, 80%, 90%, 95% or 100% identity to the sequence GDAAR NAENAARKAKEFAEEQAKLADMY AELAK (SEQ ID NO:244).
  • H2 comprises additional amino acids at the C terminus (e.g., at least 5 additional amino acids at the C terminus).
  • the amino acid sequence AELAK (SEQ ID NO:25) or AEFAK (SEQ ID NO:26) or AENAK (SEQ ID NO:27) is present at the C terminus.
  • the amino acid sequence AELAK (SEQ ID NO:25) is present at the C terminus and the amino acid sequence GDAAR (SEQ ID NO:24) is present at the N terminus.
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H3, if present, comprises the amino acid sequence: X 28 X 29 X 30 X 31 X 32 X 33 KGDX 34 RX 35 X 36 (SEQ ID NO:6); wherein X 28 X 29 X 30 X 31 X 32 X 33 , X 34 , X 35 , and X 36 , are each independently selected from any amino acid.
  • H3 comprises the amino acid sequence: X 28 X 29 X 30 X 31 X 32 X 33 KGDX 34 RX 35 X 36 (SEQ ID NO:6); wherein: X 28 is an amino acid selected from A, C, D, E, G, I, L, M, P, Q, R, S, T, V, or W (preferably A, G, I, L, M, Q, T, V, or W); X 29 is an amino acid selected from A, C, D, E, G, L, M, P, R, S, T, V, or W (preferably E, L, M, P, S, T, V, or W); X 30 is an amino acid selected from C, F, I, L, M, T, V, or W (preferably I, F, or V); X 31 is an amino acid selected from A, C, D, E, G, I, K, L, M, N, S, T, or V (preferably
  • H3 comprises the amino acid sequence: X 28 X 29 X 30 X 31 X 32 X 33 KGDX 34 RX 35 X 36 (SEQ ID NO:6); wherein X 31 is an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; X 35 is an amino acid selected from D, E, G, K, N, P, Q, R, S, V, or T X 29 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably D, E, G, K, N, P, Q, R, S, T, or V) X 33 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably D, E, G, K, N, P, Q, R, S, T, or V) X 33 is an
  • De novo proteins of the present invention include those above wherein glutamine is included in the list of amino acids for X 28, alanine and proline are removed from the list of amino acids for X 30 , proline, glutamine, and arginine are removed from the list of amino acids for X 31 , alanine and proline are removed from the list of amino acids for X 32 , X 33 is an amino acid selected from D, G, or L; and aspartic acid and proline are removed from the list of amino acids for X 34 .
  • H3 comprises the amino acid sequence: X 28 X 29 X 30 X 31 X 32 X 33 KGDX 34 RX 35 X 36 (SEQ ID NO:6); wherein X 31 and X 35 are each independently an amino acid selected from D, E, G, K, N, P, Q, R, S, or T;
  • X 29 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably D, E, G, K, N, P, Q, R, S, T, or V)
  • X 33 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y (preferably G, A, F, I, L, M, P or V );
  • De novo proteins of the present invention include those above wherein glutamine is included in the list of amino acids for X 28, alanine and proline are removed from the list of amino acids for X 30 , proline, glutamine, and arginine are removed from the list of amino acids for X 31 , alanine and proline are removed from the list of amino acids for X 32 , X 33 is an amino acid selected from D, G, or L; and aspartic acid and proline are removed from the list of amino acids for X 34 .
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H3, if present, comprises the amino acid sequence: X 28 X 29 X 30 X 31 X 32 X 33 KGDX 34 RX 35 X 36 (SEQ ID NO:6); wherein X 35 is an amino acid selected from D, E, G, K, N, P, Q, R, S, or T; X 28 is A or V; X 29 is E or V; X 31 is D X 32 is M or L; X 33 is G; X 30 , and X 36 are each independently an amino acid selected from A, F, I, L, M, P or V; and X 34 is an amino acid selected from F, Y, K or C.
  • De novo proteins of the present invention include those above wherein alanine and proline are removed from the list of amino acids for X 30 .
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H3, if present, comprises the amino acid sequence: X 28 X 29 X 30 X 31 X 32 X 33 KGDX 34 RX 35 X 36 (SEQ ID NO:6); wherein X 28 is A or V; X 29 is E or V; X 30 is I; X 31 is D X 32 is M or L; X 33 is G; X 34 is an amino acid selected from F, Y, K or C; X 35 is E; and X 36 is I.
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H3, if present, comprises the amino acid sequence: X 28 X 29 X 30 X 31 X 32 X 33 KGDX 34 RX 35 X 36 (SEQ ID NO:6); wherein X 28 is A or V or T; X 29 is E or V; X 30 is I; X 31 is D X 32 is M or L or I; X 33 is G; X 34 is an amino acid selected from F, Y, K, I, or C; X 35 is E or V; and X 36 is I.
  • the de novo proteins of the present invention can comprise two alpha helical domains, H1 and H2, and an optional beta hairpin domain H3, wherein H3, if present, comprises the amino acid sequence: X 57 X 28 X 29 X 30 X 31 X 32 X 33 KGDX 34 RX 35 X 36 X 58 ; (SEQ ID NO:28) wherein X 28 -X 36 are as provided in the any of the embodiments herein for H3; X 57 is an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; and X 58 is an amino acid selected from D, E, G, K, N, P, Q, R, S, C or T.
  • De novo proteins of the present invention include those above wherein cysteine is included in the list of amino acids for X 28 and the amino aicds D, K, M, N, P, Q, and Y are removed from the list of amino acids for X 28 , and the amino acids A, C, F, H, I, L, M, V, or W are included in the list of amino acids for X 58.
  • ACE2 protein decoys wherein X 28 , X 29 , X 30 , X 31 , X 32 , X 33 , X 34 , X 35 , and X 36 , are as described in any of the embodiments herein and the amino acids at position 7 and/or position 8 of SEQ ID NO:6 are substituted.
  • one or more of the following substitutions are made: K7M or G8R.
  • Proteins of the present invention include those wherein H3 comprises an amino acid sequence having at least 70%, 80%, 90%, 95% or 100% identity to an amino acid sequence set forth in SEQ ID NOS: 29-34 or 200.
  • AEIDLGKGDFREI (SEQ ID NO:29) AEIDLGKGDCREI (SEQ ID NO:30) VVIDLGKGDFREI (SEQ ID NO:31) VVIDLGKGDCREI (SEQ ID NO:32) AEIDMGKGDCREI (SEQ ID NO:33) AEIDMGKGDFREI (SEQ ID NO:34) VEIDLGKGDFREI (SEQ ID NO: 200).
  • ACE2 protein decoys of the present invention include those wherein H3 comprises an amino acid sequence having at least 70%, 80%, 90%, 95% or 100% identity to an amino acid sequence set forth in SEQ ID NO:29 wherein the amino acid at position 1 is A or if susbstituted is C, D, E, G, I, L, M, P, Q, R, S, T, V, or W; wherein the amino acid at position 2 is E or if susbstituted is A, C, D, G, L, M, P, R, S, T, V, or W; wherein the amino acid at position 3 is I or if substituted is C, F, L, M, T, V, or W; wherein the amino acid at position 4 is D or if substituted is A, C, E, G, I, K, L, M, N, S, T, or V; wherein the amino acid at position 5 is L or if substituted is F, I, M, or V; wherein the amino acid at position 6
  • ACE-2 protein decoys wherein no more 3, 2, or 1 of the amino acids at positions 7, 8, 9, and 11 are substituted, wherein numbering is in accordance with SEQ ID NO: 29.
  • ACE2 protein decoys comprising a H3 domain wherein the amino acid at position X 34 is not cysteine.
  • ACE2 protein decoys comprising a H3 domain wherein if X 34 is cysteine, X 32 is leucine.
  • H3 comprises at least one additional amino acid at the N terminus, preferably an amino acid that doesn’t negatively impact binding to the coronavirus spike protein.
  • the amino acid is selected from D, E, G, K, N, P, Q, R, S, Y, or T.
  • the amino acid is selected from A, C, E, F, G, I, L, R, S, T, V, or W.
  • the amino acid is selected from S, P, T, or Y or from L, S, P, T, or Y.
  • the amino acid is S or P (preferably S). Accordingly, in some aspects H3 comprises a sequence having at least 70%, 80%, 90%, 95% or 100% identity to the sequence SAEIDLGKGDFREIR (SEQ ID NO: 245) or SVEIDLGKGDFREIR (SEQ ID NO:246) [00120] In some aspects H3 comprises at least one additional amino acid at the C terminus, preferably an amino acid that doesn’t negatively impact binding to the coronavirus spike protein. In some aspects, the amino acid is selected from L, D, E, G, K, N, P, Q, R, S, or T or A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V or W .
  • the amino acid is R.
  • De novo proteins of the present invention comprise at least one structural domain that facilitates protein folding and binding-competent presentation of the alpha helices and beta hairpin domains to the coronavirus spike protein.
  • Preferred structural domains provide the de novo proteins with a hydrophobic core and serve to stabilize the relative position and orientation of the binding motifs in a manner that is competent for binding.
  • the supporting structures can be computationally generated and placed by an available method (e.g.
  • Rosetta fragment assembly, parametric generation, and the like or extracted from existing structures (see, e.g., examples herein).
  • these structural domains do not substantially map to the structural domains of the ACE2 protein (i.e., do not structurally or sequentially align to other secondary structure elements in ACE2).
  • a skilled practitioner could use protein design principles to create structural domains for use in the present invention.
  • Structural domains that facilitate protein folding and binding-competent presentation of H1, H2, and H3, if present, can comprise an amino acid sequence set forth below for D1 and D2: D1 – X A X A X B X B X C X B X B X B X A X B X B X A X C X B X C X A X B X C X A X A X B X C X A (SEQ ID NO:35).
  • each X A is independently an amino acid selected from D, E, G, K, N, P, Q, R, S, C, and T
  • each X B is independently an amino acid selected from A, F, I, L, M, C, and P (preferably A, F, I, L, M, and P)
  • each X C is independently an amino acid selected from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, Y or C
  • each X D is independently an amino acid from A, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W, Y or C.
  • X D is an amino acid selected from A, F, I, L, M, P, or V. In other embodiments X D is an amino acid selected from D, E, G, K, N, P, Q, R, S, T, or C. [00124] In some embodiments, D1 comprises the amino acid sequence set forth below: X A X A AAX A ALAX A A AX A AMKX A ALX A I IX A X A IAX A X A (SEQ ID NO:37) ; wherein each X A is independently an amino acid selected from D, E, G, K, N, P, Q, R, S, C, or T.
  • D1 comprises an amino acid sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 95% or 100% identical to the amino acid sequence set forth below: REAAEALAEAARAMKEALEIIREIAEK (SEQ ID NO:38) REAAEALAEAARAMKEALEILREIAEK (SEQ ID NO:222).
  • ACE2 protein decoys of the present invention include those wherein at least one structural domain (e.g., D1) comprises an amino acid sequence having at least 50%, 60%, 70%, 80%, 85%, 90%, 95% or 100% identity to an amino acid sequence set forth in SEQ ID NO:38 wherein the amino acid at position 1 is R or if substituted is A, C, E, F, G, I, K, L, M, P, S, T, V, or W (preferably C, E, F, G, K, L, M, P, S, T, or W ); wherein the amino acid at position 2 is E or if substituted is A, C, D, F, G, I, K, L, M, P, Q, R, S, T, V, W, or Y (preferably A, G, K, M, V, W, or Y); wherein the amino acid at position 3 is A or if substituted is C, E, G, K, L, M, P, Q, R, S, T, V, W, or Y
  • D2 comprises an amino acid sequence set forth in SEQ ID NO: 39 or 40: XAAXAXAAAXAXA IAXAAIXAXAAAXA AIAXAAAXAIAA XAA (SEQ ID NO:39) XAAXAXAAAXA VAXAAIXAXAAAXA AIVXAAAXAIAA XAA (SEQ ID NO:40); wherein each X A is independently an amino acid selected from D, E, G, K, N, P, Q, R, S, C, or T.
  • D2 comprises the amino acid sequence set forth below: RASEAAKRX 59 AX 60 AIRKAAD AIX 61 X 62 AAKIAA RA (SEQ ID NO:41), wherein X 59 is I or V, X 60 is K or R or C, X 61 is A or V or C, X 62 is E or C.
  • D2 comprises an amino acid sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 95% or 100% identical to an amino acid sequence set forth in SEQ ID NO:42-46: RASEAAKR IAKAIRKAAD AIAEAAKIAA RA (SEQ ID NO:42); RASEAAKR IACAIRKAAD AIAEAAKIAA RA (SEQ ID NO:43); RASEAAKR IAKAIRKAAD AIACAAKIAA RA (SEQ ID NO:44); RASEAAKR VARAIRKAAD AIVEAAKIAA RA (SEQ ID NO:45); RASEAAKR VACAIRKAAD AIVEAAKIAA RA (SEQ ID NO:46).
  • ACE2 protein decoys of the present invention include those wherein at least one structural domain (e.g., D2) comprises an amino acid sequence having at least 50%, 60%, 70%, 80%, 85%, 90%, 95% or 100% identity to an amino acid sequence set forth in SEQ ID NO:42 wherein the amino acid at position 1 is R or if susbstituted is A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W, or Y (preferably A, C, D, E, H, K, L, N, Q, S, or Y); wherein the amino acid at position 2 is A or if susbstituted is C, G, I, L, M, N, P, Q, S, T, V, or Y (preferably C, M, Q, T, or V); wherein the amino acid at position 3 is S or if substituted is A, C, D, E, F, G, H, I, K, L, M,
  • the de novo proteins of the present invention optionally comprise amino acid linkers between the domains.
  • the amino acid linkers may be of any length as deemed appropriate for an intended use.
  • the linkers can be, for example, from 1 to 100 amino acids in length, such as 1-100, 1-90, 1-80, 1-70, 1-60, 1-50, 1-40, 1-30, 1-20, 1-10, or 1-5 amino acids in length.
  • the flexibility in linker stems from the use of de novo protein design to construct the proteins of the present invention.
  • An exemplary ACE2 protein decoy of the present invention is CTC-445 and is as set forth in SEQ ID NO:47.
  • ACE2 protein decoy CTC-445 comprises a H1, H2, and H3 domain as well as two structural domains that facilitate protein folding and binding-competent presentation of H1, H2, and H3.
  • the order of the domains is H3-D1-D2-H2-H1.
  • the ACE2 protein decoy also includes four linkers linking together the various domains. The first linker is from amino acid 16-20, the second linker is from amino acid 48-52, the third linker is from amino acid 83-85 and the fourth linker is from amino acid 119-122. Numbering is according to SEQ ID NO:47.
  • ACE2 protein decoys that are variants of CTC-445.
  • exemplary variants are those that have introduced amino acid substitutions that play a role in optimizing the stability and/or folding of the protein. Typically, these substitutions are not at the binding interface but at other locations in the protein. Although these substitutions are not at the binding interface, they can lead to improved binding affinity, in addition to improved activity. Methods of testing proteins for improved binding, stability, and or protein folding are known in the art and are described herein.
  • CTC-445 has been demonstrated to specifically bind to the SARS-COV-2 spike protein but only weakly bind to the SARS- CoV-1 spike protein.
  • ACE2 protein decoys that are variants of CTC- 445 are capable of binding to both the SARS-COV-2 spike protein and the SARS-COV-1 spike protein with higher affinity as compared to CTC-445.
  • Exemplary ACE2 protein decoys having identity to CTC-445 include those set forth in SEQ ID NOS:48-68, 184-188, 104-172, and 224-239.
  • ACE2 protein decoys comprising a sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to an amino acid sequence set forth in SEQ ID NOS:47-68, 184-188, 104-172, 224-239, 255-257 or 265.
  • the teachings provided herein with respect to the H1, H2, H3, D1 and D2 domains in addition to the examples provided herein and the skill in the art, can be used to make variants of such ACE2 protein decoys. Table 1
  • the amino acid residue at position 88 of any of SEQ ID NOS:47-68, 184-187, and 104-172, if not A, is selected from F, I, L, M, P or V.
  • the amino acid residue at position 137 of any of SEQ ID NOS: 47-68, if not F, is D, E, G, K, N, P, Q, R, S, or T.
  • the amino acid residue at position 11 of any of SEQ ID NOS:47-68, 184-187, and 104-172 is cysteine, the amino acid residue at position 6 is L and/or the amino acid residue at position 126 is L and/or the amino acid residue at position 124 is S.
  • the ACE2 protein decoy does not have the amino acid sequence of CTC- 625 or CTC-626. In some embodiments, the ACE2 protein decoy does not have the amino acid substitutions set forth in CTC-625 (i.e., S1P_F11Y_R42S_E46S_A88N_F116N_R124S_F137V_R143L) or CTC-626 (i.e., S1P_M6L_F11Y_R42S_E46S_A88N_F116N_R124S_F137V_R143L-F152W).
  • CTC-625 i.e., S1P_F11Y_R42S_E46S_A88N_F116N_R124S_F137V_R143L
  • CTC-626 i.e., S1P_M6L_F11Y_R42S_E46S_A88N_F116N_R124S_F137V_R143L-F152W
  • Exemplary de novo proteins of the present invention include those comprising a sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to an amino acid sequence set forth in SEQ ID NOS:47-68, 184-187, or 104-172, wherein a. the amino acid residue at position 2 is V; or b. the amino acid residue at position 3 is V; or c. the amino acid residue at position 6 is L; or d. the amino acid residue at position 11 is F or C; or e. the amino acid residue at position 41 is L; or f. the amino acid residue at position 61 is V; or g.
  • the amino acid residue at position 63 is R or C; or h. the amino acid residue at position 73 is V; or i. the amino acid residue at position 74 is E or C; or j. the amino acid residue at position 84 is T; or k. the amino acid residue at position 86 is G; or l. the amino acid residue at position 92 is V; or m. the amino acid residue at position 95 is E; or n. the amino acid residue at position 100 is Q; or o. the amino acid residue at position 116 is L; or p. the amino acid residue at position 124 is S; or q. the amino acid residue at position 126 is L; or r. the amino acid residue at position 136 is T; or s.
  • Exemplary de novo proteins of the present invention include those comprising a sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to an amino acid sequence set forth in SEQ ID NOS:47-68, 184-187, 104-172, 255-256, or 224-239 wherein no more than 1 of the amino acids at positions 8, 9, or 10 is substituted; no more than 3, no more than 2 or no more than 1 of the amino acids at positions 91, 94, 98, 102, 105, 106, 109, 112, or 113 is substituted; no more than 4, no more than 3, and/or no more than 2 or no more than 1 of the amino acids at positions 123, 127, 128, 131, 132, 134, 138, 139, 141, 142
  • Exemplary de novo proteins of the present invention include those comprising a sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to an amino acid sequence set forth in SEQ ID NOS:47-68, 184-187, 104-172, 255-256, or 224-239 wherein no more than 4, 3, 2, or 1 of the amino acids at positions 8, 9, 10, 91, 94, 98, 102, 105, 106, 109, 112, 113123, 127, 128, 131, 132, 134, 138, 139, 141, 142, 145, 146, or 149 is substituted.
  • Exemplary de novo proteins of the present invention include those comprising a sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to an amino acid sequence set forth in SEQ ID NO:47-68, 184-187, 104-172, 255-256, or 224-239, wherein a. the amino acid residue at position 8 is K; b. the amino acid residue at position 9 is G; c. the amino acid residue at position 10 is D d. the amino acid residue at position 91 is N e. the amino acid residue at position 94 is N f. the amino acid residue at position 98 is K g.
  • the amino acid residue at position 102 is F h.
  • the amino acid residue at position 105 is E i.
  • the amino acid residue at position 106 is Q j.
  • the amino acid residue at position 109 is L k.
  • the amino acid residue at position 112 is M l.
  • the amino acid residue at position 113 is Y m.
  • the amino acid residue at position 123 is S n.
  • the amino acid residue at position 127 is E o.
  • the amino acid residue at position 128 is Q p.
  • the amino acid residue at position 131 is T q.
  • the amino acid residue at position 132 is F r.
  • the amino acid residue at position 134 is D s.
  • the amino acid residue at position 138 is H t.
  • the amino acid residue at position 139 is E u.
  • the amino acid residue at position 141 is E v. the amino acid residue at position 142 is D w. the amino acid residue at position 145 is Y x. the amino acid residue at position 146 is Q; and y. the amino acid residue at position 149 is L [00138] As taught herein, a great deal of variability can be present in the linkers of the exemplary protein decoys.
  • de novo proteins include those comprising a sequence at least 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to an amino acid sequence set forth in SEQ ID NOS: 69-90, 258-260, 266, or 189-193 wherein X L is an amino acid linker.
  • X L is an amino acid linker.
  • the first and second X L linker in each de novo protein is from 0-5 amino acids
  • the third X L linker in each de novo protein is from 0-3 amino acids
  • the fourth X L linker in each de novo protein is from 0-4 amino acids.
  • the amino acid residue at position 88 of any of SEQ ID NOS: 69-90, 258-259, or 189-192, if not A, is selected from F, I, L, M, P or V; wherein position 88 is in reference to SEQ ID NO:47 with fixed linker lengths.
  • the amino acid residue at position 137 of any of SEQ ID NOS: 69-90 or 189- 192, if not F, is D, E, G, K, N, P, Q, R, S, or T.
  • the amino acid residue at position 11 of any of SEQ ID NOS: 69-90, 258-259, or 189-192 is cysteine
  • the amino acid residue at position 6 is L and/or the amino acid residue at position 126 is L and/or the amino acid residue at position 124 is S.
  • Positions 88, 137, 11, 124, and 126 referred to above mean the positions in SEQ ID NOs: 69-90, 258-259, or 189-192 that correspond to positions 88, 137, 11, 124, and 126, respectively, in SEQ ID NO:47, and not the actual positions in SEQ ID NOS: 69-90 or 189-193, which may vary due to the non- fixed length of the linkers X L .
  • position 88 of any one of SEQ ID NOs: 69-90 or 189-192 means the position in any one of SEQ ID NOs: 69-90, 258-259, or 189- 193 corresponding to position 88 in SEQ ID NO: 47.
  • Exemplary de novo proteins of the present invention include those comprising a sequence at least 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to the amino acid sequence set forth in SEQ ID NO: 69-90, 258-259, or 189-192 wherein a. the amino acid residue at position 2 is V; or b.
  • the amino acid residue at position 3 is V; or c. the amino acid residue at position 6 is L; or d. the amino acid residue at position 11 is F or C; or e. the amino acid residue at position 41 is L; or f. the amino acid residue at position 61 is V; or g. the amino acid residue at position 63 is R or C; or h. the amino acid residue at position 73 is V; or i. the amino acid residue at position 74 is E or C; or j. the amino acid residue at position 86 is G; or k. the amino acid residue at position 92 is V; or l. the amino acid residue at position 95 is E; or m. the amino acid residue at position 100 is Q; or n.
  • the amino acid residue at position 116 is L ; or o. the amino acid residue at position 124 is S; or p. the amino acid residue at position 126 is L; or q. the amino acid residue at position 136 is T; or r. the amino acid residue at position 143 is S or L; or any combination of (a) – (r) – above, wherein the noted positions are according to the numbering of SEQ ID NO:47 not of SEQ ID NOS: 69-90 or 189-192 due to the non-fixed length of the linkers X L , as discussed above.
  • Exemplary de novo proteins of the present invention include those comprising a sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to the amino acid sequence set forth in SEQ ID NO: 69-90, 258-259, or 189-192 wherein no more than 1 of the amino acids at positions 8, 9, or 10 is substituted; no more than 3, no more than 2 or no more than 1 of the amino acids at positions 91, 94, 98, 102, 105, 106, 109, 112, or 113 is substituted; no more than 3, no more than 2 or no more than 1 of the amino acids at positions 123, 127, 128, 131, 132, 134, 138, 139, 141, 142, 145, 146, or 149 is substituted.
  • Exemplary de novo proteins of the present invention include those comprising a sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to the amino acid sequence set forth in SEQ ID NO: 69-90, 258-259, or 189-192 wherein no more than 4, 3, 2, or 1 of the amino acids at positions 8, 9, 10, 91, 94, 98, 102, 105, 106, 109, 112, 113123, 127, 128, 131, 132, 134, 138, 139, 141, 142, 145, 146, or 149 is substituted.
  • Exemplary de novo proteins of the present invention include those comprising a sequence at least 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to the amino acid sequence set forth in SEQ ID NO: 69-90, 258-259, or 189-192 wherein: a. the amino acid residue at position 8 is K; b. the amino acid residue at position 9 is G; c. the amino acid residue at position 10 is D d.
  • the amino acid residue at position 91 is N e. the amino acid residue at position 94 is N f. the amino acid residue at position 98 is K g. the amino acid residue at position 102 is F h. the amino acid residue at position 105 is E i. the amino acid residue at position 106 is Q j. the amino acid residue at position 109 is L k. the amino acid residue at position 112 is M l. the amino acid residue at position 113 is Y m. the amino acid residue at position 123 is S n. the amino acid residue at position 127 is E o. the amino acid residue at position 128 is Q p. the amino acid residue at position 131 is T q. the amino acid residue at position 132 is F r.
  • the amino acid residue at position 134 is D s.
  • the amino acid residue at position 138 is H t.
  • the amino acid residue at position 139 is E u.
  • the amino acid residue at position 141 is E v.
  • the amino acid residue at position 142 is D w.
  • the amino acid residue at position 145 is Y x.
  • the amino acid residue at position 146 is Q; and y. the amino acid residue at position 149 is L; wherein the noted positions are according to the numbering of SEQ ID NO:47 not of SEQ ID NOS: 69-90 or 189-192 due to the non-fixed length of the linkers X L , as discussed above.
  • ACE2 protein decoys comprising a sequence at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to the amino acid sequence set forth in SEQ ID NO:47 provided that the following substitutions are present (i) M6L_R126L; (ii) F116L_R124S; (iii) M6L_F116L_R124S_R126L; (iv) M6L_E86G_F116L_R124S_R126L; (v) A2V_E3V_M6L_I61V_K63R_A73V_K84T_E86G_A92V_A95E_ K100Q_F116L_R124S_R126L _A136T_R143S; (vi) M6L_F11C_R126L; (vii) M6L_K63C_R126L; (i) M6L_K63C_
  • the ACE2 protein decoy comprises a sequence at least 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identical to the amino acid sequence for (i) CTC 637: the following amino acids are present: 6L and 126L; (ii) CTC-638; the following amino acids are present: 116L and 124S; (iii) CTC-639; the following amino acids are present: 6L, 116L, 124S, and 126L (iv) CTC-640; the following amino acids are present: 6L, 86G, 116L, 124S, and 126L (v) CTC-641; the following amino acids are present: 2V, 3V, 6L, 61V, 63R, 73V, 84T, 86G, 92V, 95E, 100Q, 116L, 124S, 126L , 136
  • de novo proteins of the present invention comprising a decoy unit comprises an amino acid sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to an amino acid sequence set forth in SEQ ID NO: 50 (CTC-640) wherein wherein the amino acid at position 1 is S or if substituted is A, C, E, F, G, I, L, R, S, T, V, or W (preferably C, E, G, I, or L); wherein the amino acid at position 2 is A or if substituted is C, D, E, G, I, L, M, P, Q, R, S, T, V, or W (preferably G, I, L, M, Q, T, V, or W); wherein the amino acid at position 3 is E or if substituted is A, C, D, G, L, L
  • not more than 6, 5, 4, 3, 2 or 1 of positions 8, 9, 10, 12, 91, 94, 98, 102, 105, 106, 109, 112, 113, 123, 127, 128, 131, 132, 134, 135, 138, 139, 141, 142, 145, 146, and 140 are substituted.
  • amino acid residues are added at the N terminus of the protein decoys to add stability.
  • a PG sequence i.e., proline-glycine
  • the ACE2 protein decoys may comprise one, two, three, four, or more decoy units. In some embodiments, the ACE2 protein decoys may comprise one, two, three, or four decoy units.
  • a decoy unit comprises (i) at least two alpha helical domains, H1 and H2, (ii) an optional beta hairpin domain, H3, and (iii) at least one structural domain.
  • a decoy unit comprises (i) two alpha helical domains, H1 and H2, (ii) one beta hairpin domain, H3, and (iii) two structural domains.
  • the H1, H2, H3 and structural domains are as described herein.
  • Nonlimiting exemplary decoy units are provided in SEQ ID NOS: 47-90, 104-172, 184-193, 224-239, and 255-260.
  • an ACE2 protein decoy is multivalent (e.g. bivalent, trivalent, tetravelent), which means it comprises at least two decoy units.
  • the ACE2 protein decoy comprises two, three, four, or more amino acid sequences independently selected from SEQ ID NOS: 47-90, 104-172, 184-193, 224-239, 255-260, or 265-266.
  • the ACE2 protein decoy comprises multiple copies of the same decoy unit.
  • the ACE2 protein decoy comprises 2 to 4 copies of the same or a different decoy unit.
  • a the C terminus of a first decoy unit is linked to the N terminus of a second decoy unit.
  • Linkage can be via a chemical or enzymatic crosslinking (e.g. bismaleimide).
  • the two or more decoy units may directly abut each other in the translational fusion or may be linked by a polypeptide linker suitable for the intended purpose.
  • Exemplary such linkers include, but are not limited, to those disclosed in WO2016178905, WO2018153865, and WO 2018170179.
  • suitable linkers include, but are not limited to peptide linkers, such as, for example, GGGGG (SEQ ID NO: 96), GSGGG (SEQ ID NO: 97), GGGGGG (SEQ ID NO: 98), GGSGGG (SEQ ID NO: 99), GGSGGSGGGSGGSGSG (SEQ ID NO: 100), GSGGSGGGSGGSGSG (SEQ ID NO: 101), GGSGGSGGGSGGSGGGGSGGSGGGGS (SEQ ID NO: 102), GGGGSGGSGSGGSGGGS (SEQ ID NO: 175), [GGGGX]n (SEQ ID NO: 103), where X is Q, E or S and n is 2-5, and GGGSGGSGSGGSGGGS (SEQ ID NO: 264).
  • peptide linkers such as, for example, GGGGG (SEQ ID NO: 96), GSGGG (SEQ ID NO: 97), GGGGGG (SEQ ID NO: 98), GGSGGG (SEQ ID NO
  • an amino acid linker between decoy units is from 1-100, from 1-80, from 1- 60, from 1-50, from 1-40, from 1-30, or from 1-20 amino acids in length.
  • two or more ACE2 protein decoys are different. Any of the ACE2 protein decoys provided herein can be linked together for use in the present invention.
  • Exemplary ACE2 protein decoys include those comprising one or more of CTC- 640, CTC-693, CTC-694, CTC-695, CTC-702, CTC-705, or CTC-726.
  • an ACE2 protein decoy comprises (i) CTC-640 and/or CTC-693; (ii) CTC-640 and/or CTC- 694; (iii) CTC-640 and/or CTC-694.
  • an ACE2 protein decoy of the present comprises a multivalent, serially duplicated version of CTC-640; a multivalent ACE2 protein decoy wherein each individual decoy unit comprises a sequence having at least 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, or 99% identity to the amino acid sequence set forth for CTC-640; a multivalent, serially duplicated version of CTC-702; a multivalent ACE2 protein decoy wherein each individual decoy unit comprises a sequence having at least 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, or 99% identity to the amino acid sequence set forth for CTC-
  • multivalent ACE2 protein decoy e.g, bivalent, trivalent, tetravalent, etc.
  • the multivalent ACE2 protein decoys can be cyclized. Methods for cyclizing proteins are known in the art, see, for example, Wood et al., Journal of Biological Chemistry, 289:21; 14512-14519, 2014.
  • Exemplary multivalent ACE2 protein decoys include those comprising a sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to an amino acid sequence set forth in SEQ ID NOS: 91-95, 194-197, 223, or 261-263: Table 3
  • DOMAIN ORDER [00154] As a result of the proteins of the present invention being de novo synthesized proteins, there is a great deal of variability permitted both in the amino acid residues but also in the ordering of the domains (e.g. H1, H2, H3, D1 and D2).
  • the domains can be linked together via amino acid linkers in varying order and still be properly folded and presented for binding to the coronavirus spike protein.
  • the order of domains in CTC-445 and many of the CTC-445 variants provided herein is H3-D1-D2-H2-H1. The skilled artisan will understand, however, that the domains can be re-ordered and still result in active proteins.
  • re-ordering the domains is referred to as circular permutation and the amino acid sequence of a decoy unit is shifted in order to create a new N- and C-terminus.
  • re-ordering results in a new order of domains that is H1-H3-D1-D2-H2.
  • All of the CTC-445 variants described herein having an order of domains of H3-D1-D2-H2-H1 can be circular permutated to create a new domain order of H1-H3-D1-D2-H2. Due to the shifting of the H1 domain, an amino acid linker is added in between the H1 and H3 domains, creating a variable loop region between domains H1 and H3.
  • CTC-705 is a circular permutated version of CTC-640.
  • CTC-726 and CTC-786 are two circular permutated versions of CTC-702 that differ with respect to the linker between domains H1 and H3.
  • Circular permutating CTC-640 and CTC-726 in such a manner repositions the termini of the ACE2 protein decoy to allow for different orientation of the multivalent subunits. Additional ordering of domains includes, for example, H2-H1-H3- D1-D2, D2-H2-H1-H3-D1, and D1-D2-H2-H1-H3.
  • the proteins of the present invention function by binding to coronavirus spike protein, in particular, coronavirus spike protein from a coronavirus that gains entry into host cells via ACE2 as its receptor (i.e., coronavirus ACE2-binding spike protein).
  • the de novo proteins of the present invention bind to one or more amino acids in the receptor binding domain (RBD) of the spike protein SARS-COV-2.
  • the present invention includes de novo proteins that bind SARS-CoV-2 spike protein with a K d of less than 100 nM, less than 50 nm, less than 20 nM as measured by biolayer interferometry or yeast display, e.g., using the assay formats as defined in the examples.
  • the de novo proteins bind SARS-CoV-S with a Kd of less than about 20 nM, less than about 15 nM, less than about 5 nM as measured by biolayer interferometry or yeast display, e.g., using the assay formats as defined herein, or a substantially similar assay.
  • the present invention also includes de novo ACE2 protein decoys of the present invention that block more than 50%, more than 60%, more than 70% more than 80% or more than 90% of SARS-CoV-2-S binding to ACE2 as determined using assays known in the art.
  • the present invention also includes de novo proteins that neutralize or inhibit the infectivity of a coronavirus (e.g. SARS-CoV-2) for its host cells. In certain embodiments, the proteins neutralize the infectivity of SARS-CoV-2-like pseudoparticles.
  • the proteins inhibit more than 50%, more than 60%, more than 70% more than 80% or more than 90% binding of SARS-CoV-2 on human host cells in an optimized virus-like pseudo-particle (VLP) neutralization assay, e.g., as shown in the examples, or a substantially similar assay.
  • VLP virus-like pseudo-particle
  • the present invention includes de novo ACE2 protein decoys that bind to the receptor binding domain of SARS-CoV-2 spike protein or to a fragment of the domain.
  • the ACE2 protein decoys of the present invention may possess one or more of the aforementioned biological characteristics, or any combinations thereof.
  • the present invention provides nucleic acids, including isolated nucleic acids, encoding a protein or peptide of the present invention.
  • the isolated nucleic acid sequence may comprise RNA or DNA.
  • Such isolated nucleic acid sequences may comprise additional sequences useful for promoting expression and/or purification of the encoded protein, including but not limited to polyA sequences, modified Kozak sequences, and sequences encoding epitope tags, export signals, and secretory signals, nuclear localization signals, and plasma membrane localization signals. It will be apparent to those of skill in the art, based on the teachings herein, what nucleic acid sequences will encode the polypeptides of the invention. [00162] In another aspect, the present invention provides recombinant expression vectors comprising the isolated nucleic acid of any aspect of the invention operatively linked to a suitable control sequence.
  • Recombinant expression vector includes vectors that operatively link a nucleic acid coding region or gene to any control sequences capable of effecting expression of the gene product.
  • Control sequences operably linked to the nucleic acid sequences of the invention are nucleic acid sequences capable of effecting the expression of the nucleic acid molecules.
  • the control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered “operably linked" to the coding sequence.
  • control sequences include, but are not limited to, polyadenylation signals, termination signals, and ribosome binding sites.
  • expression vectors include but are not limited to, plasmid and viral-based expression vectors.
  • the control sequence used to drive expression of the disclosed nucleic acid sequences in a mammalian system may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive).
  • the expression vector must be replicable in the host organisms either as an episome or by integration into host chromosomal DNA.
  • the expression vector may comprise a plasmid, viral-based vector (including but not limited to a retroviral vector or oncolytic virus), or any other suitable expression vector.
  • the expression vector can be administered in the methods of the disclosure to express the proteins in vivo for therapeutic benefit.
  • the nucleic acids of the present invention may be administered to a subject to treat a disease described herein.
  • the present disclosure provides host cells that comprise the recombinant expression vectors disclosed herein, wherein the host cells can be either prokaryotic or eukaryotic.
  • the cells can be transiently or stably engineered to incorporate the expression vector of the invention, using techniques including but not limited to bacterial transformations, calcium phosphate co-precipitation, electroporation, or liposome mediated-, DEAE dextran mediated-, polycationic mediated-, or viral mediated transfection.
  • techniques including but not limited to bacterial transformations, calcium phosphate co-precipitation, electroporation, or liposome mediated-, DEAE dextran mediated-, polycationic mediated-, or viral mediated transfection.
  • a method of producing a protein according to the invention is an additional part of the invention.
  • the method comprises the steps of (a) culturing a host according to this aspect of the invention under conditions conducive to the expression of the protein, and (b) optionally, recovering the expressed protein.
  • the expressed protein can be recovered from the cell free extract, but preferably they are recovered from the culture medium.
  • the present disclosure provides antibodies that selectively bind to the proteins of the disclosure.
  • the antibodies can be polyclonal, monoclonal antibodies, humanized antibodies, and fragments thereof, and can be made using techniques known to those of skill in the art.
  • Exemplary proteins of the present invention can be prepared as fusion or chimeric polypeptides that include de novo ACE2 protein decoys of the present invention and a heterologous polypeptide.
  • Exemplary heterologous polypeptides can increase the circulating half-life of the resultant chimeric polypeptide in vivo, and may, therefore, further enhance the properties of the proteins of the present invention.
  • the polypeptide that increases the circulating half-life may be a serum albumin, such as human serum albumin, or the Fc region of the IgG subclass of antibodies that lacks the IgG heavy chain variable region.
  • exemplary Fc regions can include a mutation (e.g., a mutation that inhibits complement fixation and/or Fc receptor binding) or it may be lytic, i.e., able to bind complement or to lyse cells via another mechanism, such as antibody-dependent complement lysis.
  • the “Fc region” can be a naturally occurring or synthetic polypeptide that is homologous to the IgG C-terminal domain produced by digestion of IgG with papain.
  • the fusion proteins can include the entire Fc region, or a smaller portion that retains the ability to extend the circulating half-life of a chimeric polypeptide of which it is a part.
  • full-length or fragmented Fc regions can be wild-type or variants of the wild-type molecule. That is, they can contain mutations that may or may not affect the function of the polypeptides. For example, they may have effector function or may be modified as to have one or more activities associated with effector function reduced or completely eliminated. Effector function refers to those biological activities attributable to the Fc region of an immunoglobulin, which vary with the immunoglobulin isotype.
  • the de novo proteins of the present invention includes an IgG1, IgG2, IgG3, or IgG4 Fc region.
  • the de novo proteins include a variant IgG1, IgG2, IgG3, or IgG4 Fc region.
  • the variant Fc region lacks effector function.
  • a de novo protein of the present invention is fused to the C-terminus of an Fc region (e.g., a native or variant IgG1, IgG2, IgG3, or IgG4 Fc region).
  • an Fc region e.g., a native or variant IgG1, IgG2, IgG3, or IgG4 Fc region.
  • two de novo proteins of the present invention are fused to a Fc region (e.g., at the N and C terminus).
  • the proteins of the present invention may be linked to other types of stabilization compounds to promote an increased half-life in vivo, including but not limited to attachment of one or more polyethylene glycol chains (PEGylation).
  • the de novo proteins can have amino acid substitutions that enable chemical conjugation with water soluble polymers (e.g., PEG) that increase circulating half- life compared to the protein alone.
  • PEG water soluble polymers
  • a “PEG” is a poly(ethylene glycol) molecule which is a water-soluble polymer of ethylene glycol.
  • PEGs can be obtained in different sizes, and can also be obtained commercially in chemically activated forms that are derivatized with chemically reactive groups to enable covalent conjugation to proteins.
  • Linear PEGs are produced in various molecular weights, such as PEG polymers of weight-average molecular weights of 5,000 daltons, 10,000 daltons, 20,000 daltons, 30,000 daltons, and 40,000 daltons.
  • Branched PEG polymers have also been developed.
  • Commonly-used activated PEG polymers are those derivatized with N-hydroxysuccinimide groups (for conjugation to primary ambines such as lysine residues and protein N-termini), with aldehyde groups (for conjugation to N-termini), and with maleimide or iodoacetamide groups (for coupling to thiols such as cysteine residues).
  • Methods of designing moieties for conjugation to PEG are known in the art. For example, addition of polyethylene glycol (“PEG”) containing moieties may comprise attachment of a PEG group linked to maleimide group (e.g., "PEG-MAL”) to a cysteine residue of the protein.
  • PEG-MAL polyethylene glycol
  • PEG-MAL examples include, for example, methoxy PEG-MAL 5 kD; methoxy PEG-MAL 20 kD; methoxy (PEG)2-MAL 40 kD; methoxy PEG(MAL)25 kD; methoxy PEG(MAL)220 kD; methoxy PEG(MAL)240 kD; or any combination thereof.
  • an amino acid that is not necessary for binding can be replaced by cysteine to allow for attachment of a desirable moiety.
  • the ACE2 protein decoy comprises 0-4 cysteine amino acids, or in some embodiments, comprises 0 or 1 cysteine amino acids.
  • a water-soluble polymer such as a PEG molecule is linked to one or more (preferably one) of the cysteine residue.
  • Linkage can for example via a maleimide group.
  • Preferred placements of cysteine residues for conjugation to stability moieties are distal to the binding site so that the stability moiety doesn’t interfere with binding to the ACE2 coronovirus spike protein (e.g., in the structural domains that facilitate protein folding and binding-competent presentation of the alpha helices and beta hairpin domains to the coronavirus spike protein).
  • Cysteine residues can be added for reasons other than conjugation to stability moieties, for example, two or more cysteine resides can be placed to allow for formation of disulfide bonds.
  • Disulfide bonds can, in some aspects, lend additional stability to the ACE2 protein decoy.
  • the added cysteine residues need not be distal to the binding site and may be, in some aspects, at the N and C termini of the ACE2 protein decoy.
  • included in the present invention are ACE2 protein decoys comprising one or more disulfide bonds.
  • a PEG group is linked to a cysteine residue present in any one of the ACE-2 protein decoys.
  • ACE2 protein decoys having at least 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 95%, 97%, 99% or 100% identity to (i) the amino acid sequence of CTC-648 wherein the cysteine residue at position 11 is present and is linked to a PEG group; (ii) the amino acid sequence of CTC-649 wherein the cysteine residue at position 63 is present and is linked to a PEG group; (iii) the amino acid sequence of CTC-650 wherein the cysteine residue at position 74 is present and is linked to a PEG group; (iv) the amino acid sequence of CTC- 651 wherein the cysteine residue at position 11 is present and is linked to a PEG group; (v) the amino acid sequence of CTC-652 wherein the cysteine residue at position 63 is present and is linked to a PEG group; (vi) the amino acid sequence of CTC-653 wherein the cysteine residue at
  • Linkage can be, for example, via any methodology known in the art, including malimide groups.
  • Chimeric polypeptides that include a de novo protein of the present invention and a heterologous polypeptide can include those chimeric polypeptides comprising a targeting domain.
  • the targeting domain can direct cellular localization of the de novo proteins.
  • the targeting domain can be any suitable polypeptide that binds to one or more targets of interest and can be attached or associated with a polypeptide of the present invention.
  • the targeting domain may include but is not limited to an scFv, a F(ab), a F(ab’)2, a B cell receptor (BCR), a DARPin, an affibody, a monobody, a nanobody, diabody, an antibody (including a monospecific or bispecific antibody); a cell-targeting oligopeptide including but not limited to RGD integrin-binding peptides, de novo designed binders, aptamers, a bicycle peptide, conotoxins, small molecules such as folic acid, and a virus that binds to the cell surface.
  • the targeting domain may be covalently or non-covalently bound to the protein.
  • the targeting domain when present, is a translational fusion with the protein.
  • the protein and the targeting domain may directly abut each other in the translational fusion or may be linked by a polypeptide linker suitable for an intended purpose.
  • exemplary such linkers include, but are not limited, to those disclosed in WO2016178905, WO2018153865, and WO 2018170179 (. Methods of making fusion proteins and conjugates are known in the art and not discussed herein in detail.
  • the de novo ACE2 protein decoys of the present invention are useful for the treatment, and/or prevention of a disease or disorder or condition associated with a coronavirus that uses ACE2 as its receptor, e.g., SARS-CoV or SARS-CoV-2 and/or for ameliorating at least one symptom associated with such disease, disorder or condition.
  • a protein of the present invention may be administered at a therapeutic dose to a patient with a SARS-CoV or SARS-CoV-2 infection.
  • the proteins of the invention are useful to treat subjects suffering from the severe and acute respiratory infection caused by SARS-CoV or SARS- CoV-2.
  • the proteins of the invention are useful in decreasing viral titer or reducing viral load in the host. In one embodiment, the proteins of the present invention are useful in preventing or reducing inflammation in the lung of a subject with SARS-CoV or SARS-CoV-2 or another coronavirus that uses ACE2 as its entry into host cells. In one embodiment, the proteins of the present invention are useful in preventing or reducing interstitial, peribronchiolar or perivascular inflammation, alveolar damage and pleural changes in a subject with SARS-CoV or SARS-CoV-2 or another coronavirus that uses ACE2 as its entry into host cells.
  • proteins of the present invention may be administered to relieve or prevent or decrease the severity of one or more of the symptoms or conditions of the disease or disorder.
  • the proteins may be used to ameliorate or reduce the severity of at least one symptom of an infection from SARS-CoV or SARS-CoV-2 or another coronavirus that uses ACE2 as its entry into host cells including, but not limited to consisting of fever, cough, shortness of breath, pneumonia, diarrhea, organ failure (e.g., kidney failure, heart failure, and renal dysfunction), septic shock and death.
  • organ failure e.g., kidney failure, heart failure, and renal dysfunction
  • proteins of the present invention prophylactically to subjects at risk for developing a coronavirus infection.
  • the subjects are immunocompromised individuals, elderly adults (more than 65 years of age), healthcare workers, persons with occupational or recreational contact with camels or bats, family members in close proximity to coronavirus patient, adults or children with contact with persons with confirmed or suspected coronavirus infection, and patients with a medical history (e.g., increased risk of pulmonary infection, heart disease or diabetes).
  • the present de novo proteins are used for the preparation of a pharmaceutical composition or medicament for treating patients suffering from a coronavirus infection.
  • the present proteins are used as adjunct therapy with any other agent or any other therapy known to those skilled in the art useful for treating or ameliorating a coronavirus infection.
  • the de novo proteins may be combined with other therapies such an any additional therapeutic agent that may be advantageously combined with a protein of the invention.
  • the proteins of the invention may be combined with a second therapeutic agent to reduce the viral load in a patient with a coronavirus infection, or to ameliorate one or more symptoms of the infection.
  • the de novo ACE2 protein decoys can be in any form that allows for them to be administered to a patient. For example, they can be in the form of a solid or liquid.
  • the de novo proteins can be administered by any convenient route, for example by infusion or bolus injection, by absorption through epithelial or mucocutaneous linings (e.g., oral mucosa, rectal and intestinal mucosa, etc.). Administration can be systemic or local. Typical routes of administration include, without limitation, oral, topical, parenteral, sublingual, rectal, vaginal, ocular, intra-tumor, and intranasal. Parenteral administration includes subcutaneous injections, intravenous, intramuscular, intrasternal injection or infusion techniques. In one aspect, the de novo proteins are administered parenterally. In yet another aspect, the de novo proteins are administered intravenously or subcutaneously.
  • the de novo proteins are administered locally.
  • the de novo proteins are administered locally to the respiratory tract, for example, via inhalation. Inhalation can be, for example, by aerosol inhaler or inhalable powder.
  • the proteins of the present invention may be used in combination with an anti- inflammatory drug (e.g., corticosteroids, and non-steroidal anti-inflammatory drugs), an anti-infective drug, or an anti-viral drug.
  • the proteins of the present invention may be used in combination with a drug to treat cytokine release syndrome (e.g., cytokine storm.)
  • cytokine release syndrome e.g., cytokine storm.
  • nucleic acids of the present invention to subjects for the treatment, and/or prevention of a disease or disorder or condition associated with a coronavirus that uses ACE2 as its receptor, e.g., SARS-CoV or SARS-CoV-2 and/or for ameliorating at least one symptom associated with such disease, disorder or condition.
  • a nucleic acid of the present invention may be administered at a therapeutic dose to a patient with a SARS-CoV or SARS-CoV-2 infection PHARMACEUTICAL COMPOSITIONS
  • Pharmaceutical compositions can be formulated so as to allow the de novo ACE2 protein decoys to be bioavailable upon administration of the composition to a patient.
  • the de novo proteins can take the form of solutions, suspensions, emulsion, microparticles, tablets, pills, pellets, capsules, capsules containing liquids, powders, sustained-release formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other form suitable for use.
  • compositions are, for example, in tablet or powder form.
  • carrier(s) can be liquid, with the compositions being, for example, an oral syrup or injectable liquid.
  • the carrier(s) can be gaseous or particulate, so as to provide an aerosol composition useful in, e.g., inhalatory administration.
  • the de novo proteins are preferably in solid or liquid form, where semi-solid, semi-liquid, suspension and gel forms are included within the forms considered herein as either solid or liquid.
  • the composition can be formulated into a powder, granule, compressed tablet, pill, capsule, chewing gum, wafer or the like form. Such a solid composition typically contains one or more inert diluents.
  • binders such as carboxymethylcellulose, ethyl cellulose, microcrystalline cellulose, or gelatin
  • excipients such as starch, lactose or dextrins, disintegrating agents such as alginic acid, sodium alginate, Primogel, corn starch and the like
  • lubricants such as magnesium stearate or Sterotex
  • glidants such as colloidal silicon dioxide
  • sweetening agents such as sucrose or saccharin, a flavoring agent such as peppermint, methyl salicylate or orange flavoring, and a coloring agent.
  • composition when in the form of a capsule, e.g., a gelatin capsule, it can contain, in addition to materials of the above type, a liquid carrier such as polyethylene glycol, cyclodextrin or a fatty oil.
  • a liquid carrier such as polyethylene glycol, cyclodextrin or a fatty oil.
  • the composition can be in the form of a liquid, e.g., an elixir, syrup, solution, emulsion or suspension.
  • the liquid can be useful for oral administration or for delivery by injection.
  • a composition can comprise one or more of a sweetening agent, preservatives, dye/colorant and flavor enhancer.
  • compositions for administration by injection one or more of a surfactant, preservative, wetting agent, dispersing agent, suspending agent, buffer, stabilizer and isotonic agent can also be included. Also contemplated are delayed release capsule, including those with an enteric coating.
  • the liquid compositions can also include one or more of the following: sterile diluents such as water for injection, saline solution, preferably physiological saline, Ringer's solution, isotonic sodium chloride, fixed oils such as synthetic mono or digylcerides which can serve as the solvent or suspending medium, polyethylene glycols, glycerin, cyclodextrin, propylene glycol or other solvents; antibacterial agents such as benzyl alcohol or methyl paraben; antioxidants such as ascorbic acid or sodium bisulfate; chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose.
  • sterile diluents such as water for injection, saline solution, preferably physiological saline, Ringer's solution, isotonic sodium chloride
  • fixed oils such as synthetic mono or digylcer
  • a parenteral composition can be enclosed in ampoule, a disposable syringe or a multiple-dose vial made of glass, plastic or other material.
  • Physiological saline is an exemplary adjuvant.
  • An injectable composition is preferably sterile.
  • the present disclosure provides pharmaceutical compositions, comprising one or more proteins of the disclosure and a pharmaceutically acceptable carrier.
  • carrier refers to a diluent, adjuvant or excipient, with which de novo protein of the present invention is administered.
  • the pharmaceutical composition may comprise, for example, in addition to the polypeptide of the disclosure (a) a lyoprotectant; (b) a surfactant; (c) a bulking agent; (d) a tonicity adjusting agent; (e) a stabilizer; (f) a preservative and/or (g) a buffer.
  • the buffer in the pharmaceutical composition is a Tris buffer, a histidine buffer, a phosphate buffer, a citrate buffer or an acetate buffer.
  • the pharmaceutical composition may also include a lyoprotectant, e.g. sucrose, sorbitol or trehalose.
  • the pharmaceutical composition includes a preservative e.g.
  • the pharmaceutical composition includes a bulking agent, like glycine.
  • the pharmaceutical composition includes a surfactant e.g., polysorbate- 20, polysorbate-40, polysorbate- 60, polysorbate-65, polysorbate-80 polysorbate-85, poloxamer-188, sorbitan monolaurate, sorbitan monopalmitate, sorbitan monostearate, sorbitan monooleate, sorbitan trilaurate, sorbitan tristearate, sorbitan trioleaste, or a combination thereof.
  • the pharmaceutical composition may also include a tonicity adjusting agent, e.g., a compound that renders the formulation substantially isotonic or isoosmotic with human blood.
  • Exemplary tonicity adjusting agents include sucrose, sorbitol, glycine, methionine, mannitol, dextrose, inositol, sodium chloride, arginine and arginine hydrochloride.
  • the pharmaceutical composition additionally includes a stabilizer, e.g., a molecule which, when combined with a protein of interest substantially prevents or reduces chemical and/or physical instability of the protein of interest in lyophilized or liquid form.
  • Exemplary stabilizers include sucrose, sorbitol, glycine, inositol, sodium chloride, methionine, arginine, and arginine hydrochloride.
  • the proteins may be the sole active agent in the pharmaceutical composition, or the composition may further comprise one or more other active agents suitable for an intended use.
  • the proteins are provided in a therapeutically effective amount. This refers to an amount of the protein effective for treating the disease or having the desired effect.
  • the data obtained from the cell culture assays and animal studies can be used in formulating a range of dosage for use in humans. Dosage regimens can be adjusted by clinicians to provide the optimum desired response (e.g., a therapeutic or prophylactic response).
  • the compositions can be administered one from one or more times per day to one or more times per week; including once every other day.
  • An exemplary dosage range for the de novo ACE2 protein decoys may, for instance, be 0.1 ⁇ g/kg-100 mg/kg body weight; alternatively, it may be 0.5 ⁇ g/kg to 50 mg/kg; 1 ⁇ g/kg to 25 mg/kg, or 5 ⁇ g/kg to 10 mg/kg body weight.
  • the recommended dose could be lower than 0.1 mcg/kg, especially if administered locally. In other embodiments, the recommended dose could be based on weight/m 2 (i.e. body surface area), and/or it could be administered at a fixed dose (e.g., .05-100 mg). In some aspects, the fixed dose will be 1, 3, 10 or 20 mg doses.
  • the polypeptides can be delivered in a single bolus, or may be administered more than once (e.g., 2, 3, 4, 5, or more times) as determined by an attending physician. [00193]
  • the adminstriaton will be via intransal spray and dosing will be fixed dose. In some aspects, the fixed dose will be 0.5, 1, 2, 3, 10 or 20 mg doses.
  • Computation design of CTC-445 Computational design of ACE2 mimetics took place in four phases: selection of structural motifs from ACE2 to mimic, design of polypeptide backbones to support those structural motifs, design of sequences to support the generated backbones, and filtering of designs to reduce the set of designs to be tested experimentally.
  • ACE2 structural motifs to mimic Structural motifs were obtained and computational design and analysis were performed using one or more of the following structures: 1) the crystal structure of ACE2 bound to a chimeric SARS-CoV + SARS-CoV- 2 RBD (PDB ID: 6vw1); 2) the cryo-EM structure of ACE2 bound to SARS-CoV-2 RBD (PDB ID: 6m17); or 3) the cryo-EM structure of ACE2 bound to SARS-CoV RBD (PDB ID: 6cs2).
  • Two RBD-binding helices from ACE2 were identified and extracted from the structures; one (“H1”) spanning residues 19-53; and the other (“H2”) spanning residues 55- 84.
  • the beta-hairpin motif spanning residues 346-360 was also included in the set of motifs to mimic. These structural motifs were used as the starting point for backbone design. From these motifs, a set of residues to be preserved in the final sequence designs was selected: 19, 23, 24, 27, 28, 30, 31, 34, 35, 37, 38, 41, 42, 45, 61, 64, 68, 72, 75, 76, 79, 82, 83, 352, 353, 354, 355 and 357. Since these structural motifs alone are not well-supported by other secondary structure elements from ACE2, de novo secondary structures were placed against the binding motifs.
  • These new secondary structures provide the designed proteins with a hydrophobic core and serve to stabilize the relative position and orientation of the binding motifs in a manner that is competent for binding.
  • the supporting structures were either computationally generated and placed by an available method (e.g. Rosetta combinatorial fragment assembly, parametric generation, etc.) or extracted and copied from existing structures.
  • Rosetta combinatorial fragment assembly e.g. Rosetta combinatorial fragment assembly, parametric generation, etc.
  • extracted and copied from existing structures e.g. Rosetta combinatorial fragment assembly, parametric generation, etc.
  • the initial position and orientation of the helix was determined by visual inspection (using pymol ) and refined by a montecarlo search to identify the optimal location.
  • the core elements i.e., the starting motifs and supporting secondary structure
  • the core elements were rebuilt by identifying parametric equations of repetitive phi and psi angles (omega fixed to 180°) that result in secondary structures that recapitulated each of the target helices as close as possible, a “pitch” on the phi and psi angles was allowed every 3rd residue in order to allow the helices the possibility to have curvature.
  • the algorithm can vary the length of each of the core elements up to +/- 12 amino acids (compared to the input structural motifs).
  • the mimetic building protocol aims to reconnect the idealized elements by pairs in all possible combinations.
  • the loop database was filtered to identify the loops that could connect the pair of secondary structure elements.
  • the termini of the loop were superimposed to the termini of the core elements and were required to be ⁇ 1.2 A RMSD from the core element termini, with each individual loop terminus required to have ⁇ 1.5 A RMSD from the corresponding individual core element terminus.
  • the core elements were minimized by cartesian-constrained backbone minimization.
  • the solutions are verified to contain highly ideal fragments (i.e. that every overlapping fragment that composes the two connected elements is also contained within the database) and that no backbone clashes with the target (context) binding partner.
  • Successful pairwise connected core elements were then profiled using the same database of fragments in order to determine the most probable amino acids at each position (this information was encoded as metadata on each design).
  • solutions for pairs of connected secondary structures were combinatorially combined (by using graph theory connected components) to produce fully connected backbones. Since the number of solutions grows exponentially with each pair of elements, at each fragment combination step we ranked the designs to favor those with shorter interconnections between pairs of secondary structure core elements (i.e. effectively with shorter loops), and kept only the top solutions.
  • CTC-445 Yeast display screening EB100 yeast were transformed with genes encoding the proteins to be displayed together with a linearized pETcon3 vector. The vector was linearized by 100-fold overdigestion by NdeI and XhoI (New England Biolabs) and then purified by gel extraction (Qiagen).
  • TCEP was added to 1 mg of sample at a 10x molar excess. Incubation was at 37 degrees Celsius for 30 minutes. Maleimide-PEG (20K linker and 40K branched) was dissolved in PBS and added to the sample at a 10x fold excess and incubated for 12 hours. Unreacted maleimides were removed and purification was via SEC.
  • FIG. 2A-F show binding of a positive, negative control, and CTC-445 to SARS-CoV-2 Spike Protein via yeast display and 3A-D show that control human ACE2 and de novo protein CTC-445 were able to bind to 2019-nCoV Spike/RBD Protein and compete for binding with soluble ACE2. ⁇
  • Example 3 Characterization of De Novo ACE2 Protein Decoys [00205] CTC-445 variants were characterized for expression levels (0-5), solubility (0-5) and binding affinity via biolayer interferometry (0-5).
  • SEC-MALS Size exclusion chromatography with multi-angle light scattering
  • Protein samples were prepared in a PBS buffer at concentrations ranging from 2 to 4 mg/mL and filtered with 0.22 ⁇ m syringe filters; 100 ⁇ L of the samples were injected and run at a flow rate of 0.5 mL/min. Data were analyzed using the software ASTRA 7 (Wyatt Technologies). SEC-MALS demonstrated that CTC-445 variants, CTC-632, 633, 634, 642, 643, and 644 were mostly aggregated or oligomeric and CTC-635, 636, 637, 639, 640, 641,643, 644, 646, 648, 649, 650, 651, 652, and 653 were mostly monomeric (data not shown). Data is shown in table below. [00207] Biolayer Interferometry.
  • Octet binding assay of purified ACE2 protein decoys Binding data were collected in an Octet RED96 (ForteBio) and processed using the instrument’s integrated software using a 1:1 binding model.
  • SARS-CoV-2 Spike Protein (RBD domain only, mFc Tag, Sino Biological) were immobilized to Protein A sensors (ProtA, ForteBio) at 2 ⁇ g ml ⁇ 1 in binding buffer (10 mM HEPES, pH 7.4, 150 mM NaCl, 3 mM EDTA, 0.05% surfactant P20, 0.5% non-fat dry milk).
  • Example 4 Binding kinetics and stability of de novo protein decoys CTC-445, CTC-445.2 and CTC-445.2d
  • the thermal stability of CTC-445 and select variants was measured using circular dichroism.
  • Far-ultraviolet circular dichroism measurements were carried out with an CHIRASCAN spectrometer V100 (Applied Photophysics) in PBS buffer (pH 7.4) in a 0.1 mm path-length cuvette with protein concentration of 0.2 mg ml ⁇ 1 (unless otherwise mentioned in the text).
  • Temperature melts were obtained from 20 to 95 °C and monitored absorption signal at 222 nm (steps of 0.5 °C per min, 30 s of equilibration by step).
  • Serial thermal ramping protein stability Serial thermal ramping assays were performed in order to test the unfolding reversibility (thermal recovery) of CTC-445, CTC- 445.2, CTC-445.2d and hACE2 (Sino Biological); to this end, the UNCLE platform (Unchained Labs) was used.8.8 ⁇ L of CTC-445, CTC-445.2 and CTC-445.2d at 0.5 mg/mL were loaded in capillary cuvettes and fluorescence spectra ( ⁇ 300-400 nm) were measured for each sample while temperature was increased and decreased repeatedly.
  • Example 5 – CTC-445 inhibits SARS-CoV-2 Spike Protein RBD binding to human ACE2
  • SARS-CoV2 S Protein RBD was coated at 0.5 ⁇ g/mL (100 ⁇ L/well) in flat, clear bottom, high binding polystyrene 96-well plate (Thermo, Cat#15041) overnight at 4C. On the following day, assay plate was washed 3X with 0.05% Tween-PBS and blocked with 2% BSA/0.05%Tween-PBS for 1-1.5 hour at 37C.
  • test article dilution was added to plate at 50 ⁇ L/well followed by 50 ⁇ L/well of biotin-hACE2 (0.03 ⁇ g/mL final). Incubation of test articles with biotin-hACE2 was performed for 1 hour at 37C. Afterwards, the plate was washed (3X) followed by incubation of 0.1 ⁇ g/mL (100 ⁇ L/well) streptavidin-HRP for 1 hour at 37C. The plate was washed (3X) a final time before addition of 100 ⁇ L/well TMB (abCam, Cat#171524).
  • TMB was incubated for 7 minutes and reaction was stopped with addition of 50 ⁇ L/well 1M HCl. Mean absorbance in each well was measured from 4 spots at 450 nm.
  • the IC 50 results are shown in Table 6 below: Table 6 Example 6 – Select CTC-445 variants inhibit SARS-CoV-2 Spike Protein RBD binding to human ACE2 [00212] SARS-CoV2 S Protein RBD was coated at 0.5 ug/mL (100ul/well) in flat, clear bottom, high binding polystyrene 96-well plate (Thermo, Cat#15041) overnight at 4C.
  • assay plate was washed 3X with 0.05% Tween-PBS and blocked with 2% BSA/0.05%Tween-PBS for 1-1.5 hours at 37C.
  • Three-fold 6-point serial dilution of each test article was prepared starting at 4uM (2X final) for CTC-445 variants shown in Table 7 below and at 360nM (2X final) for positive control inhibitor (Acro).
  • a ten-fold dilution was prepared for the last two concentrations rather than three- fold.
  • a dose response of biotin-hACE2 was prepared by performing a three-fold serial dilution of biotin-hACE2 starting at 0.1 ⁇ g/mL (1.15nM).
  • biotin-hACE2 From the 0.1 ⁇ g/mL biotin- hACE2, a constant concentration of 0.066 ⁇ g/mL (2X final) was prepared to be used for inhibition by test articles. After blocking, assay plate was washed (3X) and each test article dilution was added to plate at 30 ul/well (in duplicate wells) followed by 30ul/well of biotin-hACE2 (0.033 ⁇ g/mL final). Incubation of test articles with biotin-hACE2 was performed for 1 hour at 37 C. Afterwards, plate was washed (3X) followed by incubation of 0.1 ⁇ g/mL (100 ul/well) streptavidin-HRP for 1 hour at 37C.
  • TMB 100ul/well TMB (abCam, Cat#171524). TMB was incubated for 7 minutes and reaction was stopped with addition of 50ul/well 1M HCl or equivalent. Mean absorbance in each well was measured from 4 spots at 450nm.
  • IC 50 results are shown in Table 7 below: Table 7 Example 7 – Select PEGylated and non-PEGylated CTC-445 variants inhibit SARS-CoV-2 Spike Protein RBD binding to human ACE2 [00213] SARS-CoV2 S Protein RBD was coated at 0.5 ⁇ g/mL (100ul/well) in flat, clear bottom, high binding polystyrene 96-well plate (Thermo, Cat#15041) overnight at 4C. On the following day, assay plate was washed 3X with 0.05% Tween-PBS and blocked with 2% BSA/0.05%Tween-PBS for 1-1.5 hours at 37C.
  • test article dilution was added to plate at 30 ul/well (in duplicate wells) followed by 30ul/well of biotin-hACE2 (0.033 ⁇ g/mL final). Incubation of test articles with biotin-hACE2 was performed for 1 hour at 37C. Afterwards, plate was washed (3X) followed by incubation of 0.1 ⁇ g/mL (100 ul/well) streptavidin-HRP for 1 hour at 37C. Plate was washed (3X) a final time before addition of 100ul/well TMB (abCam, Cat#171524). TMB was incubated for 7 minutes and reaction was stopped with addition of 50ul/well 1M HCl or equivalent. Mean absorbance in each well was measured from 4 spots at 450nm. (Note: starting concentration for CTC654 and CTC655
  • SARS-CoV2 S protein receptor-binding domain (Acro Biosystems, Cat#EP- 105) or SARS-CoV-1 S Protein RBD (Acro Biosystems, Cat#SPD-S52H6) was coated at 0.5 ⁇ g/mL (100ul/well) in flat, clear bottom, high binding polystyrene 96-well plate (ThermoFisher, Cat#15041) overnight at 4°C.
  • each assay plate was washed 3X with 0.05% Tween-PBS and blocked with 2% BSA/0.05%Tween-PBS for 1-1.5 hours at 37°C.
  • 12-point serial dilutions of each test article were prepared at a concentration 2-fold higher than final.
  • a 3-fold serial dilution was performed starting at 4 ⁇ M.
  • human ACE2 (SinoBiological, Cat#10108- H08B)
  • a 3-fold serial dilution was performed starting at 0.8 ⁇ M.
  • a dose response of biotin-hACE2 was prepared by performing a two-fold serial dilution starting at 0.174 ⁇ g/mL (2 nM).
  • biotin-hACE2 From the 0.174 ⁇ g/mL biotin-hACE2, a constant concentration of 0.07 ⁇ g/mL (0.8 nM, 2X final) was prepared to be used for inhibition by test articles. After blocking, each assay plate was washed (3X) and each test article dilution was added to plate at 50 ⁇ L/well (single replicate well per concentration) followed by 50 ⁇ L/well of biotin- hACE2 (0.035 ⁇ g/mL final). Incubation of test articles with 0.4 nM biotin-hACE2 was performed for 1 hour at 37°C.
  • each assay plate was washed (3X) followed by incubation of 0.1 ⁇ g/mL (100 ⁇ L/well) streptavidin-HRP for 1 hour at 37°C. Finally, each plate was washed (3X) before addition of 100 ⁇ L/well TMB (abCam, Cat#171524). TMB was incubated for 7-8 minutes and reaction was stopped with addition of 50 ⁇ L/well TMB Stop Solution (abCam, Cat# ab171529). Mean absorbance in each well was measured from 4 spots at 450nm. [00215] As shown in Table 9, all of the tested constructs inhibited SARS-CoV-2 spike protein binding to human ACE2.
  • CTC654 also inhibited SARS-CoV-1 spike protein binding to human ACE2.
  • Example 9 –CTC-445 variants inhibit SARS-CoV-2 Spike Protein RBD binding to human ACE2
  • Additional CTC-445 variants were assayed for inhibition of SARS-Cov-2 spike protein binding to human ACE2, substantially as described above.
  • tested constructs inhibited SARS-Cov-2 spike protein binding to human ACE2 with an IC 50 of less than 30 nM, and in most instances, less than 15 nM. See also Figure 19C for CTC- 708.
  • Table 10A Inhibition of spike protein binding to ACE2
  • Additional protein decoys were assayed for inhibition of SARS-Cov-2 spike protein binding to human ACE2 as follows: [00218] SARS-CoV-2 S protein (D614G), His Tag, Super stable (Acro Biosystems, Cat#SPD-C52H3) was coated at 0.5 ⁇ g/mL (50ul/well) in flat, white bottom, high binding polystyrene 96-well plate (LumiNunc MaxiSorp Thermo Scientific, Cat#437796) overnight at 4°C.
  • each assay plate was washed 4X with 0.05% Tween-PBS and blocked with 2% BSA/0.05%Tween-PBS for 1-1.5 hours at 21°C on a shaker set to 600rpm. Twelve-point serial dilutions of each test article were prepared at a concentration 2-fold higher than final. For the test articles, a two-fold serial dilution was performed starting at 220 nM and ending at 0.107 nM. In addition, a dose response of biotin-hACE2 was prepared by performing a two-fold serial dilution starting at 600 ng/mL (68.81 nM, 2X final) and ending at 0.585 ng/ml (6.71 pM).
  • biotin-hACE2 From the 600 ng/mL biotin-hACE2, a constant concentration of 80 ng/mL (0.92 nM, 2X final) was prepared to be used for inhibition by each test article dilution. Each test article dilution was added to a U-bottom polystyrene plate at 90 ⁇ L/well followed by 90 ⁇ L/well of biotin-hACE2 (0.46 nM final) and mixed well. The biotin-hACE2 dose response was mixed with equal volumes of assay buffer. After blocking, each assay plate was washed (4X) and test article dilutions were added to plate in duplicate wells at 50 ⁇ L/well.
  • Table 10C [00220] In order to determine an optimal amino acid linker length between ACE2 protein decoy units in a bivalent ACE-2 protein decoy comprising a serially duplicated version of CTC-726, a 10 amino acid, 20 amino acid, 30 amino acid, 50 amino acid and 60 amino acid GS linker was used to link the 2 decoy units. IC50s were determined as provided above. The protein decoys were expressed and purified and tested for binding via Octet as described herein and were determined to have an estimated Kd of ⁇ 1 nM. Table 10D Example 10 – Potency of CTC-445 variants vs. molecular weight.
  • IC50 values for CTC-445 variants for binding to SARS-CoV-2 S/RBD were measured by ELISA in the presence of 0.033 nM biotinylated soluble hACE2.
  • hACE2 bound to RBD was quantified by treatment with streptavidin-HRP and measurement of absorbance at 450 nm.
  • SARS-CoV2 S Protein RBD (Acro, Cat#EP-105) was coated at 0.5 ⁇ g/mL (100 ⁇ L/well) in flat, clear bottom, high binding polystyrene 96-well plate (Thermo, Cat#15041) overnight at 4°C.
  • each assay plate was washed 3X with 0.05% Tween- PBS and blocked with 2% BSA/0.05%Tween-PBS for 1-1.5 hours at 37°C.
  • 11-point four-fold serial dilutions of each test article were prepared starting at 2 ⁇ M (2X final).
  • a dose response of biotin-hACE2 was prepared by performing a two-fold serial dilution starting at 0.174 ⁇ g/mL (2 nM). From the 0.174 ⁇ g/mL biotin-hACE2, a constant concentration of 0.07 ⁇ g/mL (0.8 nM, 2X final) was prepared to be used for inhibition by test articles.
  • each assay plate was washed (3X) and each test article dilution was added to the plate at 50 ⁇ L/well (single replicate well per concentration) followed by 50 ⁇ L/well of biotin-hACE2 (0.035 ⁇ g/mL final). Incubation of test articles with 0.4 nM biotin-hACE2 was performed for 1 hour at 37°C. Afterwards, each assay plate was washed (3X) followed by incubation of 0.1 ⁇ g/mL (100 ⁇ L/well) streptavidin-HRP for 1 hour at 37°C. Lastly, each plate was washed (3X) before addition of 100 ⁇ l/well TMB (abCam, Cat#171524).
  • TMB was incubated for 7-8 minutes and reaction was stopped with addition of 50 ⁇ L/well TMB Stop Solution (abCam, Cat# ab171529). Mean absorbance in each well was measured from 4 spots at 450 nm. The results are shown in Figure 11.
  • Example 11 VSV-luc pseudovirus neutralization
  • Neutralization activity was determined using a non-replicative VSV pseudovirus with a firefly luciferase gene and expressing the spike protein from Wuhan-1 SARS-CoV-2 isolate (GenBank: QHD43416.1)
  • Neutralization assays were performed with either SARS- CoV-2 / VSV-Luc (expressing the full ectodomain of the SARS-CoV-2 spike protein on the surface of the pseudovirus) or VSVg / VSV-Luc (expressing VSVg).
  • Thirty thousand 293T- ACE2 cells were seeded the day before in white plates in the presence of hygromycin (100 ⁇ g/mL).
  • Test-items were evaluated in duplicates using serial 3-fold dilutions.
  • Controls included cells infected with a VSV pseudovirus lacking spike protein (NoEnv / VSV-Luc), with pseudovirus carrying VSVg (VSVg / VSV-Luc), or uninfected cells (“mock-infected”). Pseudovirus-infected cells were incubated with test-item or vehicle alone (DMEM5).
  • the average percentage infectivity for each concentration together with the standard deviation of duplicates was obtained from duplicate values generated.
  • the 50% inhibitory concentration (IC50) was determined with GraphPad Prism. The IC50 was defined as the concentration of test-item at which the relative luminescence units (RLUs) were reduced by 50% as compared with the values in cells infected with pseudovirus in the absence of test-item).
  • RLUs relative luminescence units
  • a cell viability assay (right) was run in parallel. The assays were performed using CTC-445.2 and CTC-445.2d ( Figures 12A-B) and CTC-445.2d and CTC- 445.3d ( Figure 21A-C).
  • CTC 445.2, CTC-445.2d and CTC-445.3d neutralized VSV pseudovirus expressing SARS-CoV-2.
  • No inhibition of pseudovirus infection in HEK293T cells expressing VSVg with CTC-445.2d and CTC-445.3d was observed demonstrating specificity for the RBD of SARS-CoV-2, with no discernable loss of HEK293 cell viability.
  • Example 12 Pharmacokinetics of the de novo decoys [00226] Eight-week-old Balb/c mice (Charles River) were anesthetized with isoflurane and 30 ⁇ L of CTC-445.2d was delivered intranasally. Mice were euthanized at indicated time points and whole blood and lungs isolated.
  • Lung lysate was prepared through mechanical disruption of tissue using Precellys tissue homogenizer (Bertin) followed by lysis with T-PER tissue lysis buffer (ThermoFisher) containing protease/phosphatase inhibitor cocktail (ThermoFisher). Lysates were cleared by centrifugation and frozen for analysis. Standard 96-well MSD (L15XA) plate was pre-coated with 50 ⁇ L of 0.5 ⁇ g/mL SARS-CoV-2 S Protein RBD tagless (Sino, 40592-VNAH) overnight at 4 °C and sealed. Dilution of capture reagent was prepared in PBS.
  • assay plate was washed (6X) and 50 ⁇ L/well of each sample or standard dilution was added. Assay plate was incubated for additional 1 hour at RT with plate shaking followed by washing (6X).
  • a 1-step detection method was used. With 1-step detection, rabbit anti-His mAb (RevMab, 31-1048-00) and SULFOTAG goat anti-rabbit IgG (H+L) pAb (MSD, R32AB) were pre-mixed at 1 ⁇ g/mL each for 1 hour prior to addition to assay plate at 50 ⁇ L/well. After last wash, 150 ⁇ L/well of MSD Gold Read Buffer was added and luminescence measured on the MSD Quickplex SQ120.
  • Luminescence was converted to concentration based on a standard curve.
  • Figure 13 shows bioavailability of CTC-445.2d in mice lung (top) and plasma (bottom) after intranasal administration. Protein concentration in lung lysates and blood plasma quantified using Meso Scale Discovery platform. The results show high persistence of fully functional CTC-445.2d in the lungs for more than 24 hours. Despite using an intranasal administration route, CTC-445.2d was detected in the blood raising the possibility that intranasal delivery might also allow for some degree of systemic exposure to the protein.
  • Example 13 ACE2 functional assay.
  • FIG. 14 shows ACE2 functional activity as measured by enzymatic release of a free fluorophore from Mca-APK(DNP) substrate. ACE2 inhibition was shown using DX 6 00 peptide as a positive control. This assay indicates that the de novo designed proteins do not significantly affect the functional activity of ACE2.
  • Example 14 SARS-1 binding. [00228] Binding kinetics of CTC-445.2 and CTC-445.2d to SARS-CoV-1 RBD was measured by Octet as described herein.
  • CTC-445.2 (left) and CTC-445.2d (right) were incubated with SARS-CoV-1 S RBD immobilized to Anti-Penta_HIS sensors and incubated with varying concentrations of CTC-445.2 (123-10000 nM; left) and CTC-445.2d (82-6667 nM; right).
  • Example 15 Kinetics of binding for CTC-445.2 to SARS-CoV-2 RBD mutants.
  • Five SARS-CoV-2 RBD mutants were immobilized via Anti-Penta-His sensors, and CTC-445.2 was titrated in solution. Data was globally fit to a 1:1 model for each RBD mutant.
  • Figure 16 demonstrates that CTC-445.2 binds to the mutants.
  • Example 16 – Cytoprotection Assay [00230] The test compounds in PBS were prepared at eight concentrations in MEM solution with 50 ⁇ g/mL gentamicin and 2% FBS.
  • Test materials CTC-640 and CTC-641 were evaluated using a high test concentration of 50 ⁇ M and seven serial three-fold dilutions.
  • CTC-655 was evaluated using a high test concentration of 10 ⁇ M and seven serial three-fold dilutions.
  • the remaining test articles were evaluated using a high test concentration of 20 ⁇ M and seven serial three-fold dilutions.
  • One hundred microliters of each concentration were added in triplicate wells for efficacy and in duplicate wells for cytotoxicity. Each dilution was added to 5 wells of a 96-well plate with 80-100% confluent Vero76 cells (3 x 104 cells per well) and incubated at 37°C/5% CO2 for 2 hours.
  • SARS-CoV-2 virus (strain USA-WA1/2020) was prepared to achieve the lowest possible multiplicity of infection (MOI; 0.002) that would yield >80% cytopathic effect (CPE) at four days.
  • MOI multiplicity of infection
  • untreated virus control wells reached maximum cytopathic effect (CPE) following four days' infection, plates were stained with neutral red dye for approximately 2 hours, then supernatant dye was removed and the incorporated dye was extracted in 50:50 Sorensen citrate buffer/ethanol, then read on a spectrophotometer.
  • sequence for the fusion protein is as shown in SEQ ID NO:247: MGWSCIILFLVATATGVHSEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVT CVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWE SNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSL SPGKGGSGGGSGPGSVEIDLGKGDFREIRASEDAREAAEALAEAARAMKEALEILREIAEKLR DSSRASEAAKRIAKAIRKAADAIAEAAKIAARAAKDGDAARNAENAARKAKEFAEEQAKLA DMYAELAKNGDKSSVLEQLKTFADKAFHEMEDLFYQAALAVFE
  • CTC-702 was fused to the C-terminus of one IgG1 Fc domain and the N terminus of another.
  • the fusion protein was expressed and purified and tested for binding via Octet as described herein. The fusion showed comparable binding to the non-fused protein decoy.
  • Yeast was grown in C-Trp-Uraselective medium and later induced for 12- 18 h in SGCAA medium.
  • Cells were incubated with varying concentrations of SARS-CoV- 2 RBD mFc diluted in PBSF (50mM NaPO4, 150mM NaCl, pH 7.4), with decreasing RBD concentrations at every round of sorting: 200 nM (round 1, 108 cells sorted), 10 nM (round 2, 107 cells sorted), 1 nM (round 3, 106 cells sorted), 100 pM (round 4, 106 cells sorted), and 10 pM and 1 pM (round 5, 106 cells sorted). This incubation was done on ice for 30 min, and later washed with chilled PBSF.
  • Example 19 Binding specificity of CTC445.2d assessed using a comprehensive human proteome binding assay.
  • CTC445.2d at 1 ⁇ g/mL ( 30 nM) chemically labeled with Alexa-647 was used for the protein-protein interaction assay.
  • Z scores were computed using the average Z score of the duplicate spots of a given protein (each protein is printed in duplicate on a HuProt TM array).
  • S score is the difference of the Z Scores of a given protein and the one ranked after it. If the S score of the top hit is > 3 from the next hit, the test protein is considered as highly specific against that hit. [00236] Table 13 shows all hits with Z Score > 10. CTC445.2d demonstrates specific binding to the SARS-CoV-2 RBD, with a Fluorescence value of 65,535 translating to the highest Z Score of 88.6. The S Score to the next highest binding gene, QDPR, is 27.6, demonstrating highly specific binding to the intended target RBD of SARS-CoV-2.
  • Enrichment values were then converted to a positional probability score for each mutation at each position, and plotted as a sequence logo using logomaker [ref: https://www.biorxiv.org/content/10.1101/635029v1]. Letters are scaled according to their probability and ordered from highest probability (top) to lowest (bottom). The native sequence of CTC445.2 is shaded in black. The skilled artisan can use the results to guide the selection of preferred and non-preferred substitions at various positions for the ACE2 protein decoys.
  • a clinical design for a De Novo ACE2 Protein Decoy is designed to identify the safety profile, recommended dose and schedule, and clinical activity of the De Novo ACE2 Protein Decoy.
  • the route of administration is local administration to the respiratory tract (including nasopharynx and lungs) by inhalation of nebulized De Novo ACE2 Protein Decoy, and/or systemic administration intravenously.
  • patient population are COVID-19 patients with high risk of clinical morbidity or mortality from SARS-CoV-2, e.g. who are hospitalized and who require oxygen supplementation for hypoxia.
  • the trial may avoid recruitment of patients who require positive pressure ventilation.
  • the trial is conducted in two parts.
  • the first part is a dose-escalation study intended to investigate the safety of the De Novo ACE2 Protein Decoy in hospitalized hypoxic COVID-19 patients, and to identify the recommended part 2 dose and schedule (RP2DS).
  • the second part is a cohort-expansion study intended to explore the clinical activity of the De Novo ACE2 Protein Decoy in hospitalized hypoxic COVID-19 patient when administered at the RP2DS.
  • Endpoints include e.g. survival, requirement for positive pressure ventilation, time to recovery and/or discharge, and clinical safety profile.
  • the patient population could be individuals at risk of coronavirus infection and administration could be prophylactic.
  • SARS-CoV-2 invades host cells in a two-step process. First, its S protein RBD attaches to the cell by binding to hACE2, a membrane associated protein, triggering endocytosis.
  • the virus escapes the endosome via a protease-cleavage- mediated fusion peptide.
  • the process is similar to the beta-coronaviruses HCoV-NL63 and SARS-CoV-1, which also target hACE2 for cellular entry. In principle, inhibiting the viral interaction with hACE2 should prevent infection.
  • the design strategy was applied to engineer, validate and optimize de novo hACE2 decoys to neutralize SARS-CoV-2. [00242] Approximately 35,000 plausible computational models of de novo ACE2 decoys were engeinerred. By using yeast display, 196 of the top ranked designs were individually tested for binding.
  • the low binding affinity and potency of CTC-445 are likely the result of instability of its folded state ( ⁇ GNI ⁇ -2.7 kcal mol -1 , Tm ⁇ 75.3 °C).
  • CTC-445.2 is predominantly monomeric, thermodynamically hyperstable ( ⁇ GNI ⁇ - 5.0 kcal mol -1 , T m ⁇ 93 °C), exhibits low nanomolar affinity for the RBD of SARS-CoV-2 (K D ⁇ 21.0 nM), has improved cross-reactivity to SARS-CoV-1 (K D ⁇ 7.1 ⁇ M), and can efficiently compete hACE2 binding to the SARS-CoV-2 RBD (IC 50 @ ACE2[0.4nM] ⁇ 10.4 nM).
  • the sequence of CTC-445.2 has no significant identity with hACE2 (either in terms of linear sequence alignment or structural sequence alignment, ClustalW ⁇ 22% , MICAN ⁇ 34%, respectively, Fig.
  • CTC-445.2d (Fig. 2A), a bivalent version of CTC-445.2, has a ⁇ 10-fold improvement in binding affinity for both SARS-CoV-2 RBD (K D ⁇ 3.5 nM) and SARS-CoV-1 RBD (K D ⁇ 587 nM), and a similar increase in its ability to compete with hACE2 binding to SARS-CoV-2 RBD (IC 50 @ ACE2[0.4nM] ⁇ 700 pM).
  • CTC-445.2d As designed, the binding interface of the SARS-CoV-2 RBD with CTC-445.2 closely mirrors the target hACE2 interface.
  • CTC-445.2 has a large binding advantage over ACE2 for many of the RBD mutations, likely a result of both its higher stability and smaller size.
  • the high and specific binding affinity of the optimized de novo protein decoys translated into effective and specific in vitro neutralization of SARS-CoV-2 viral infection.
  • the presence of the de novo decoys showed no impact on mammalian cell viability or on the enzymatic activity of hACE2.
  • the de novo decoys were able to fully neutralize viral infection in in vitro systems of cell infection.
  • the de novo protein design approach to generate decoys is orthogonal to traditional therapeutics and has the potential to better overcome the problem of mutational viral evasion.
  • Natural proteins repurposed often present significant challenges for development as therapeutics, such as low stability that can complicate manufacturing, transport and storage; residual/undesirable biological activity; and the risk of eliciting an autoimmune response.
  • the de novo protein decoys are easy to manufacture (i.e. in traditional bacterial systems) and their thermodynamic hyperstability can enable simplified transport and storage, possibly without the need of a cold chain.
  • the de novo decoy’s resilience to viral escape is believed to be a unique feature of the described design strategy.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Engineering & Computer Science (AREA)
  • Public Health (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Animal Behavior & Ethology (AREA)
  • Veterinary Medicine (AREA)
  • Molecular Biology (AREA)
  • Wood Science & Technology (AREA)
  • Virology (AREA)
  • Biochemistry (AREA)
  • Epidemiology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Microbiology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Oncology (AREA)
  • Communicable Diseases (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Toxicology (AREA)
  • Biophysics (AREA)
  • Cell Biology (AREA)
  • Peptides Or Proteins (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Vascular Medicine (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

L'invention concerne des leurres protéiques de novo de ACE2 et leurs utilisations. Selon certains modes de réalisation, l'invention concerne des procédés de traitement d'une infection à coronavirus à l'aide des leurres protéiques de ACE2.
EP21721753.8A 2020-04-07 2021-04-06 Leurres protéiques de novo de l'enzyme 2 de conversion de l'angiotensine (ace2) Withdrawn EP4132964A2 (fr)

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US202063006463P 2020-04-07 2020-04-07
US202063028401P 2020-05-21 2020-05-21
US202062705150P 2020-06-13 2020-06-13
US202063055051P 2020-07-22 2020-07-22
US202063060489P 2020-08-03 2020-08-03
US202063094179P 2020-10-20 2020-10-20
US202163145352P 2021-02-03 2021-02-03
PCT/US2021/025974 WO2021207207A2 (fr) 2020-04-07 2021-04-06 Leurres protéiques de novo de l'enzyme 2 de conversion de l'angiotensine (ace2)

Publications (1)

Publication Number Publication Date
EP4132964A2 true EP4132964A2 (fr) 2023-02-15

Family

ID=75675016

Family Applications (1)

Application Number Title Priority Date Filing Date
EP21721753.8A Withdrawn EP4132964A2 (fr) 2020-04-07 2021-04-06 Leurres protéiques de novo de l'enzyme 2 de conversion de l'angiotensine (ace2)

Country Status (7)

Country Link
EP (1) EP4132964A2 (fr)
JP (1) JP2023521706A (fr)
KR (1) KR20220164736A (fr)
CN (1) CN115515972A (fr)
AU (1) AU2021251760A1 (fr)
CA (1) CA3173628A1 (fr)
WO (1) WO2021207207A2 (fr)

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050282154A1 (en) * 2003-10-06 2005-12-22 The Brigham And Women's Hospital, Inc. Angiotensin-converting enzyme-2 as a receptor for the SARS coronavirus
AR105616A1 (es) 2015-05-07 2017-10-25 Lilly Co Eli Proteínas de fusión
US20200165326A1 (en) 2017-02-24 2020-05-28 Philogen S.P.A. Immunoconjugates with optimized linkers and orientation
WO2018170179A1 (fr) 2017-03-15 2018-09-20 Silverback Therapeutics, Inc. Composés de benzazépine, conjugués et utilisations associées
WO2020018935A2 (fr) * 2018-07-19 2020-01-23 University Of Washington Conception de novo de commutateurs protéiques

Also Published As

Publication number Publication date
JP2023521706A (ja) 2023-05-25
WO2021207207A3 (fr) 2021-11-25
AU2021251760A1 (en) 2022-10-06
CA3173628A1 (fr) 2021-10-14
CN115515972A (zh) 2022-12-23
KR20220164736A (ko) 2022-12-13
WO2021207207A2 (fr) 2021-10-14

Similar Documents

Publication Publication Date Title
WO2021170113A1 (fr) Méthode de traitement de coronavirus à l'aide d'une protéine de fusion ace-2-fc
EP2086998B1 (fr) Sequences d'acides amines se liant a des proteines seriques essentiellement independamment du ph, composes comprenant celles et utilisations correspondantes
US20220235103A1 (en) Novel ankyrin repeat binding proteins and their uses
US20230257726A1 (en) Ace2 compositions and methods
US11466059B2 (en) Modified protein
WO2021170131A1 (fr) Ace2 soluble et protéine de fusion et leurs utilisations
US20230242592A1 (en) Identification of biomimetic viral peptides and uses thereof
US20240115659A1 (en) Viral treatment
TW202227507A (zh) 用於預防或治療病毒及其他微生物感染之組合物
WO2022147290A1 (fr) Anticorps anti-sars-cov2 à neutralisation croisée
JP7019600B2 (ja) 加齢黄斑変性症治療用ペプチド
US9718862B2 (en) Polypeptides and their use in treating and limiting respiratory syncytial virus infection
EP2834263B9 (fr) Peptides immunogeniques epitope-scaffold contre le rsv
EP4132964A2 (fr) Leurres protéiques de novo de l'enzyme 2 de conversion de l'angiotensine (ace2)
US10765754B2 (en) Compositions and methods related to inhibition of respiratory syncytial virus entry
US10766929B2 (en) De novo designed hemagglutinin binding proteins
CN116601291A (zh) 修饰的血管紧张素转换酶2(ace2)及其用途
CN116209672A (zh) Sars-cov-2抑制剂

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: UNKNOWN

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20221103

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20230428