WO2022240743A1 - Procédés pour l'identification et le traitement de formes sévères de covid-19 - Google Patents

Procédés pour l'identification et le traitement de formes sévères de covid-19 Download PDF

Info

Publication number
WO2022240743A1
WO2022240743A1 PCT/US2022/028326 US2022028326W WO2022240743A1 WO 2022240743 A1 WO2022240743 A1 WO 2022240743A1 US 2022028326 W US2022028326 W US 2022028326W WO 2022240743 A1 WO2022240743 A1 WO 2022240743A1
Authority
WO
WIPO (PCT)
Prior art keywords
subject
ms4a4a
gclm
cfap97
arl4c
Prior art date
Application number
PCT/US2022/028326
Other languages
English (en)
Inventor
Seiamak BAHRAM
Thomas W. Chittenden
Raphael Carapito
Original Assignee
Genuity Science, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Genuity Science, Inc. filed Critical Genuity Science, Inc.
Publication of WO2022240743A1 publication Critical patent/WO2022240743A1/fr

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/12Antivirals
    • A61P31/14Antivirals for RNA viruses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/02Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving viable microorganisms
    • C12Q1/18Testing for antimicrobial activity of a material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/70Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
    • C12Q1/701Specific hybridization probes
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/569Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
    • G01N33/56983Viruses
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/106Pharmacogenomics, i.e. genetic variability in individual responses to drugs and drug metabolism
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/005Assays involving biological materials from specific organisms or of a specific nature from viruses
    • G01N2333/08RNA viruses
    • G01N2333/165Coronaviridae, e.g. avian infectious bronchitis virus
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H20/00ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance
    • G16H20/10ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance relating to drugs or medications, e.g. for ensuring correct administration to patients

Definitions

  • ICU intensive care unit
  • RNA Hadjadj et al., 2020
  • plasma Trouillet-Aimpuls et ah, 2020
  • genetic level Zhang et al., 2020.
  • Severity was also shown to be correlated with profound immune dysregulations including modifications in the myeloid compartment with increases in neutrophils (Meizlish et al., 2021; Schulte-Schrepping et al., 2020), decreases in non- classical monocytes (Silvin et al., 2020) and dysregulation of macrophages (Giamarellos-
  • SARS- CoV-2 induces characteristic molecular changes in critical patients that can be used to differentiate them from non-critical patients.
  • the present invention is based, at least in part, on the discovery that certain driver genes may also be responsible for the development of critical illness, and such genes may represent therapeutic targets.
  • the multi- omics approaches disclosed herein included Whole Genome Sequencing (WGS), whole blood RNA-sequencing (RNA-seq), quantitative plasma and Peripheral Blood Mononuclear Cells (PBMC) proteomics, multiplex plasma cytokine profiling and high throughput immune cells phenotyping in conjunction with viral parameters i.e., anti-SARS-Cov-2 neutralizing antibodies and multi-target antiviral serology.
  • WGS Whole Genome Sequencing
  • RNA-seq whole blood RNA-sequencing
  • PBMC Peripheral Blood Mononuclear Cells
  • multiplex plasma cytokine profiling and high throughput immune cells phenotyping in conjunction with viral parameters i.e., anti-SARS-Cov-2 neutralizing antibodies and multi-target antiviral serology.
  • viral parameters i.e., anti-SARS-Cov-2 neutralizing antibodies
  • multi-target antiviral serology Provided herein are unique gene signatures that differentiate critical from non-
  • the up-regulated metalloprotease ADAM9 is identified as a key driver. Inhibition of ADAM9 ex vivo interfered with SARS-Cov-2 uptake and replication in human epithelial cells. In brief, an advanced integrated machine learning and probabilistic programming strategy was applied to identify causal molecular drivers of severe forms of COVID-19 in a small, tightly controlled cohort of patients, the importance of which were then experimentally validated.
  • kits for treating or preventing severe coronavirus disease 2019 (COVID-19) in a subject comprising administering to the subject a composition comprising modulating agents oiADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSP Modulating agents may decrease or increase the activity or level of the corresponding gene products (e.g ., transcript and/or protein).
  • modulating agents oiADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSP Modulating agents may decrease or increase the activity or level of the corresponding gene products (e.g ., transcript and/or protein).
  • provided herein are methods of treating and/or preventing severe COVID-19 in a subject. In further aspects, provided herein are methods for predicting the likelihood of a subject infected with SARS-CoV-2 to progressing to severe COVID-19.
  • such methods include (a) sequencing at least part of the subject's genome in a sample from said subject, wherein the at least part of said genome comprises an ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSl gene; (b) identifying from the sequencing of said sample at least one at least one single-nucleotide polymorphism (SNP in one or more of genes: ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSP, and (c) administering a corresponding modulating agent that decreases or increases the expression or activity of the gene products of one or more of ADAM9, MCEMP1, MS4A4A, RAB10,
  • the method comprises (a) sequencing at least part of the subject's genome in a sample from said subject, wherein the at least part of said genome comprises an ADAM9 gene; (b) identifying from the sequencing of said sample at least one single-nucleotide polymorphism (SNP) in ADAM9; and (c) administering a corresponding inhibitor of the ADAM9 gene or its activity.
  • SNP single-nucleotide polymorphism
  • said methods comprise (a) sequencing and/or measuring (e.g, qPCR, digital PCR) at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises at least one mRNA of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 genes; (b) determining the expression level of at least one of ADAM9, MCEMP1, MS4A4A, RABIO, GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSl of step (a) and comparing it to a reference value, wherein the expression level of at least one of ADAM9, MCEMP1, MS4A4A, RABIO, GCLM, EPHX2,
  • said methods comprise (a) sequencing at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises the mRNA of ADAM9 ; (b) determining the expression level of the ADAM9 gene at the mRNA or protein level and comparing it to a reference value, wherein the expression level of the ADAM9 gene relative to the reference value indicates whether the subject will respond to an inhibitor of the ADAM9 expression or activity; and (c) administering said modulating agent of ADAM9, MCEMPl, MS4A4A, RABIO, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 expression or activity.
  • provided herein are methods for monitoring a human subject suffering from CoVID-19 for potential treatment with a modulating agent that decreases or increases the expression or activity of the gene products of one or more of ADAM9,
  • the methods comprise a) obtaining a gene expression profile from the sample, wherein the expression profile comprises expression levels for one or more genes; wherein said one or more genes comprise at least ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or A ( "SSI ; and b) comparing the gene expression profile of each sample chronologically, wherein an increase in one or more of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSl expression overtime identifies the subject as a critical subject; and c) administering to the subject the corresponding modulating agent or combination of modulating agents.
  • the methods comprise a) obtaining a gene expression profile from the sample, wherein the expression profile comprises expression levels for ADAM9; and b) comparing the gene expression profile of each sample chronologically, wherein an increase in ADAM9 expression over time identifies the subject as a critical subject; and c) administering to the subject an ADAM9 inhibitor.
  • the methods comprise (a) sequencing or genotyping of at least part of the subject's genome in a sample from said subject, wherein the at least part of said genome comprises one or more of an ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C or ACSS1 gene; (b) identifying from the sequencing or genotyping of said sample at least one SNP in one or more of genes ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1; and (c) using individual SNPs to form individual SNP risk scores or to combine multiple SNPs to define polygenic risk scores to provide an indication of the likelihood of progression to severe COVID-19.
  • kits for predicting the likelihood of a subject infected with SARS-CoV-2 to progressing to severe COVID-19 comprise: (a) sequencing or genotyping at least part of the subject's genome in a sample from said subject, wherein the at least part of said genome comprises one or more of an ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C or
  • ACSS1 gene (b) identifying from the sequencing or genotyping of said sample at least one
  • the methods comprise: (a) sequencing or other measurement or measuring ( e.g .
  • qPCR, digital PCR of at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises at least one mRNA of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 genes; (b) determining the expression level of at least one of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 of step (a); (c) forming from said expression level a feature vector; and (d) providing the feature vector to a trained classifier and receiving therefrom an indication of the likliohood of progression to severe COVID-19.
  • methods for predicting the likelihood of a subject infected with SARS-CoV-2 progressing to severe COVID-19 comprising one or more of following steps: (a) measuring the level of soluble ADAM9 protein in a sample from the subject; (b) measuring the expression level of ADAM9 at the RNA level in a sample from the subject; and/or (c) measuring the expression level of ADAM9 at the protein level in a sample from the subject.
  • measuring the expression level of the ADAM9 gene comprises one or more of: (a) measuring the level of soluble ADAM9 protein; (b) measuring the expression level of ADAM9 at the RNA level; or (c) measuring the expression level of ADAM9 at the protein level; wherein when the level of ADAM9 expression exceeds a threshold limit the subject is administered an ADAM9 inhibitor; and wherein when the level of ADAM9 expression does not exceed said threshold limit the subject is not administered an ADAM9 inhibitor.
  • the disclosed methods of treating severe COVID-19 may include (a) bringing a biological sample into contact with an antibody immobilized on a solid support, wherein said antibody specifically binds an ADAM9- induced peptide cleavage product; (b) incubating the biological sample in contact with the immobilized antibody under conditions such that a cleavage product-antibody complex is formed when the cleaved peptide is present in the biological sample; (c) contacting said cleavage product-antibody complex with a reporter group-conjugated anti-immunoglobulin; (d) incubating the cleavage product- antibody complex in contact with the reporter group-conjugated anti-immunoglobulin under conditions such that a cleavage product-antibody-reporter group-conjugated anti immunoglobulin complex is formed when the cleaved peptide is present in the biological sample; (e) adding substrate to the cleavage product-antibody-re
  • the product or the change in the substrate measured is proportional to the amount of ADAM9- induced peptide cleavage product in the biological sample.
  • the subject when the level of ADAM9- induced peptide cleavage product exceeds a threshold limit the subject is administered an ADAM9 inhibitor.
  • the level of ADAM9- induced peptide cleavage product does not exceed said threshold limit the subject is not administered an ADAM9 inhibitor.
  • the method comprises (a) sequencing of at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises at least the 600 genes in the genomic signature disclosed herein; (b) determining the expression levels of the at least the 600 genes in the genomic signature disclosed herein; (c) forming from said expression levels a feature vector; and (d) providing the feature vector to a trained classifier and receiving therefrom an indication of the likelihood of progression to severe COVID-19.
  • the methods comprise (a) sequencing of at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises at least the 600 genes in the genomic signature disclosed herein; (b) determining the expression levels of the at least the 600 genes in the genomic signature disclosed herein; (c) forming from said expression levels a feature vector; and (d) providing the feature vector to a trained classifier and receiving therefrom an indication of the likelihood of progression to severe ARDS.
  • in vitro diagnostic kits for the analysis and/or detection of driver and/or dowstream genes such as (without limitation) one or more of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1.
  • the in vitro diagnostic kits provided herein are for the analysis of at least part of a subjects genome, e.g ., for the detection and identification of single-nucleotide polymorphisms (SNPs) in one or more driver and/or dowstream genes disclosed herein.
  • SNPs single-nucleotide polymorphisms
  • the in vitro diagnostic kits provided herein are for the detection and/or analysis of the expression level (e.g, transcript or protein level) of one or more driver and/or dowstream genes disclosed herein.
  • the in vitro diagnostic kits contemplated herein are for the detection of protein, such as soluble ADAM9 protein.
  • the in vitro diagnostic kits provided herein are for the detection and/or analysis of the activity of the gene product of one or more driver and/or dowstream genes disclosed herein, e.g., detection and analysis of the proteolytic activity of ADAM9 protein.
  • Figure 1 shows the global multi-omics analysis strategy to identify pathways and drivers of ARDS.
  • C Critical patients
  • NC Non-critical patients
  • H Healthy Controls
  • PBMC peripheral blood mononuclear cells
  • Plasma was used for cytokine profiling (ELISA for IL-17, V-PLEX Proinflammatory Panel and S-PLEX Human IFN-a2a Kit, Mesoscale Discovery) and whole proteomics.
  • RNA-seq PaxGene tubes, PreAnalytiX
  • WGS Whole Genome Sequencing
  • RNA-seq pipeline based on NC vs.
  • C RNA-seq pipeline based on NC vs.
  • RNA-seq data was split 100 times with 80% for training and the rest for testing.
  • feature selection was done based on differential expression; the genes that were significantly differentially expressed for each partition of training data were selected for both the training and corresponding test data.
  • Classification was performed with an ensemble computational approach using 7 different algorithms. After classification and verifying the quality of the results on the test dataset, an ensemble feature ranking score across 6 of the 7 algorithms and all 100 partitions of the data was determined. The top 600 of those features was used as the input for structural causal modeling to derive a putative causal network.
  • C RNA-seq pipeline based on NC vs.
  • Cytokines and immune cells were quantified following the manufacturer’s instructions.
  • WGS data was used for eQTL analysis together with the gene counts from RNA-seq.
  • proteomics data were subjected to differential protein expression and nGOseq enrichment analyses.
  • D The key pathways and drivers resulting from the omics analyses (B and C) were validated in a replication cohort of 81 critical and 73 recovered critical patients.
  • the differential expression of ADAM9, the main driver gene, was compared to publicly available bulk RNA-seq data.
  • in vitro infection experiments with SARS-CoV-2 were conducted to validate a driver gene candidate.
  • Figure 2 shows immune profiling of healthy individuals, non-critical and critical COVID-19 patients:
  • A. Pro-inflammatory cytokines were quantified in plasma by using cytokine profiling assays (V-PLEX Proinflammatory Panel and S-PLEX Human IFN-a2a Kit, Mesoscale Discovery) or ELISA (IL-17, R&D Systems).
  • B. Absolute Lymphocyte counts. Each dot represents a single patient.
  • D-G. Proportions of modified lymphocyte subsets from COVID-19 patients and healthy controls as determined by mass cytometry.
  • T-cell subsets D
  • B-cell subsets E
  • Dendritic cells F
  • Non-classical monocytes G
  • the other cell subsets are presented in Figure 4.
  • Each dot represents a single patient.
  • P-values were determined with the Kruskal -Wallis test, followed by Dunn’s post-test for multiple group comparison; *P ⁇ 0.05, ** P ⁇ 0.01, *** P ⁇ 0.001, **** p ⁇ 0.0001.
  • B the P-value is determined from a two- tailed unpaired t test; * P ⁇ 0.05, ** P ⁇ 0.01, *** P ⁇ 0.001, **** P ⁇ 0.0001.
  • Figure 3 shows Type I interferon response.
  • ISG Interferon Stimulated Genes
  • A Interferon Stimulated Genes (ISG) scores based on mean normalized expression of six genes (IFI44L, IFI27, RSAD2, SIGLEC1, IFIT1, ISG15) in RNA-seq data.
  • B Heatmap showing expression of type I IFN-related genes in RNA-seq data. Up-regulated proteins are shown in red and down-regulated proteins are shown in light blue.
  • C IFNa2a (pg/ml) concentration evaluated by ultra-sensitive S-PLEX Human IFNa2a Kit (Mesoscale Discovery).
  • D Time-dependent IFNa2a concentration in the critical group.
  • E Quantification of plasmacytoid dendritic cells as a percentage of PBMCs.
  • Figure 4 shows immune profiling in healthy individuals, non-critical and critical COVID-19 patients by mass cytometry. Proportions of modified lymphocyte subsets from COVID-19 patients and healthy controls as determined by mass cytometry: proportions of dendritic cells subsets (A), monocytes subsets (B), NK cells subsets (C), NKT (D), gd T-cells (E) and granulocyte subsets (traces) including neutrophils (F) are shown. Each dot represents a single patient.
  • Figure 5 shows plasma and PBMC proteomics of healthy individuals, non-critical and critical COVID-19 patients.
  • A Total number of proteins identified in plasma of patients and healthy controls. Each dot represents a patient.
  • B Multidimensional scaling plot of normalized intensities of all patients/individuals of the three groups.
  • C Volcano-plot representing the differentially expressed proteins (DEPs) in Critical versus Non-critical patients. The orange dots represent the proteins that are differentially expressed with a corrected P-value ⁇ 0.05. Proteins labelled in green and purple represent down-regulated apolipoproteins and up-regulated acute phase proteins, respectively.
  • D Normalized intensities of the proteins S100A8 and S100A9 in the three groups.
  • E Heatmap showing the expression of apolipoproteins involved in macrophage functions and acute phase proteins in the three groups. Up-regulated proteins are shown in red and down-regulated proteins are shown in light blue.
  • F Total number of proteins identified in PBMC of patients and healthy controls. Each dot represents a patient.
  • G Multidimensional scaling plot of normalized intensities of all patients/individuals of the three groups.
  • H Volcano-plot representing the DEPs in Critical versus Non-critical patients.
  • the orange dots represent the proteins that are differentially expressed with a corrected P-value ⁇ 0.05.
  • Proteins labelled in green and purple represent up- regulated proteins involved in regulation of blood coagulation and myeloid cell differentiation, respectively.
  • I Heatmap showing the expression of proteins involved in regulation of blood coagulation and myeloid cell differentiation in the three groups. Up- regulated proteins are shown in red and down-regulated proteins are shown in light blue.
  • Figure 6 shows RNA-seq and combined omics analysis of critical patient’s specific pathways.
  • the orange dots represent the genes that are differentially expressed with a corrected P-value ⁇ 0.05.
  • Proteins labeled in green and purple represent up-regulated genes involved in blood pressure regulation and viral entry, respectively.
  • B Gene set enrichment analysis plots showing positive enrichment of inflammatory response, myeloid leukocyte activation and neutrophil degranulation pathways. NES, normalized enrichment score.
  • C Enriched nested gene ontology (nGO) categories in critical vs. non-critical patients in RNA-seq, plasma proteomics and PBMC proteomics.
  • Figure 7 shows integrated AI/ML and probabilistic programming of non-critical and critical COVID-19 patients.
  • A ROCs on the train and test set for Critical vs Non-critical groups comparison. All methods perform similarly. Other classification metrics are given in Table 4.
  • B Putative network showing flow of causal information based on top 600 most informative genes for classifying RNA-seq data of Critical versus Non-critical patients.
  • C Box plots showing the normalized gene counts of the five driver genes in critical and non- critical patients. The indicated values correspond to the FDR.
  • Figure 8 shows results of in silico perturbation experiments.
  • Left change in BIC (Bayesian Information Criterion) when perturbing each gene individually. Genes are ordered by the change in the number of ancestors minus the number of descendants for the DAG shown in Figure 7B; /. e. , the top 5 driver genes are the 5 leftmost points, and the top 5 response genes are the 5 rightmost points.
  • Right Change in the BIC of a random sample of 5 genes from the left. The mean BIC of the top 5 driver genes is shown in red.
  • Figure 9 shows validation of the RNA-seqsignature-based classification performance of critical and recovered critical COVID-19 patients.
  • A. ROCs on the train and test set for Critical vs Recovered Critical groups comparison in the replication cohort with the 600 gene signature identified from the initial cohort. All methods perform similarly.
  • B. Classification metrics.
  • Figure 10 shows validation of ADAM9 as a key driver for viral infection and replication.
  • A Quantitative RT-PCR confirmation of differential expression of ADAM9 non- critical vs. critical patients.
  • B Soluble ADAM9 (sADAM9) concentration in plasma of healthy, non-critical and critical patients determined by ELISA.
  • C Soluble MICA concentration (sMICA) in serum of healthy, non-critical and critical patients determined by ELISA.
  • D Expression of ADAM9 according to the genotype of the eQTL rs7840270.
  • E Experimental approach to assess the viral up-take and the viral replication in silenced Vero- E6 or A549-ACE2 cells.
  • F F.
  • Figure 12 shows validation of ADAM9 silencing.
  • A Quantitative RT-PCR of the ADAM9 transcript in Vero-E6 or A549-ACE2 cells silenced with a control siRNA or an ADAM9-specific siRNA. The average silencing achieved is 66% and 93% for Vero-E6 and A549-ACE2, respectively (mean of 3 representative experiments).
  • B Western blot of Vero- E6 and A549-ACE2 cells that have not been transfected (NT), silenced with a control siRNA (ctl) or with an ADAM9-specific siRNA (sik).
  • an element means one element or more than one element.
  • administering means providing a pharmaceutical agent or composition to a subject, and includes, but is not limited to, administering by a medical professional and self-administering.
  • amino acicT is intended to embrace all molecules, whether natural or synthetic, which include both an amino functionality and an acid functionality and capable of being included in a polymer of naturally-occurring amino acids.
  • exemplary amino acids include naturally-occurring amino acids; analogs, derivatives and congeners thereof; amino acid analogs having variant side chains; and all stereoisomers of any of the foregoing.
  • the term “ antibody ” may refer to both an intact antibody and an antigen binding fragment thereof.
  • Intact antibodies are glycoproteins that include at least two heavy (H) chains and two light (L) chains inter-connected by disulfide bonds.
  • Each heavy chain includes a heavy chain variable region (abbreviated herein as VET) and a heavy chain constant region.
  • Each light chain includes a light chain variable region (abbreviated herein as VL) and a light chain constant region.
  • VH and VL regions can be further subdivided into regions of hypervariability, termed complementarity determining regions (CDR), interspersed with regions that are more conserved, termed framework regions (FR).
  • CDR complementarity determining regions
  • FR framework regions
  • the variable regions of the heavy and light chains contain a binding domain that interacts with an antigen.
  • the constant regions of the antibodies may mediate the binding of the immunoglobulin to host tissues or factors, including various cells of the immune system (e.g ., effector cells) and the first component (Clq) of the classical complement system.
  • the term “antibody” includes, for example, monoclonal antibodies, polyclonal antibodies, chimeric antibodies, humanized antibodies, human antibodies, multispecific antibodies (e.g., bispecific antibodies), single chain antibodies and antigen-binding antibody fragments.
  • antigen binding site refers to a region of an antibody or T cell that specifically binds the epitope(s) of an antigen.
  • binding r or “ interacting’ ’ refers to an association, which may be a stable association, between two molecules, e.g, between a peptide and a binding partner or agent, e.g, small molecule, due to, for example, electrostatic, hydrophobic, ionic and/or hydrogen- bond interactions under physiological conditions.
  • tissue sample includes a tissue sample or a bodily fluid sample.
  • a tissue sample includes, but is not limited to, buccal cells, a brain sample, a skin sample, or an organ sample (e.g, liver).
  • a bodily fluid sample includes all fluids that are present in the body including, but not limited to, blood, plasma, serum, saliva, synovial fluid, lymph, urine, or cerebrospinal fluid.
  • the sample may also be obtained by subjecting it to a pre-treatment step, if necessary, e.g, by homogenizing the sample or by extracting or isolating a component of the sample. Suitable pre-treatment steps may be selected by one skilled in the art depending on nature of the biological sample.
  • samples such as serum samples can be diluted prior to analysis.
  • the source of the tissue sample may be solid tissue, as from a fresh, frozen and/or preserved organ, tissue sample, biopsy, or aspirate; blood or any blood constituents, serum, blood; bodily fluids such as cerebral spinal fluid, amniotic fluid, peritoneal fluid or interstitial fluid, urine, saliva, stool, tears; or cells from any time in gestation or development of the subject.
  • Gene construct may refer to a nucleic acid, such as a vector, plasmid, viral genome or the like which includes a “coding sequence” for a polypeptide or which is otherwise transcribable to a biologically active RNA (e.g, antisense, decoy, ribozyme, etc.), may be transfected into cells, e.g, mammalian cells, and may cause expression of the coding sequence in cells transfected with the construct.
  • the gene construct may include one or more regulatory elements operably linked to the coding sequence, as well as intronic sequences, polyadenylation sites, origins of replication, marker genes, etc.
  • operably linked to refers to the functional relationship of a nucleic acid with another nucleic acid sequence. Promoters, enhancers, transcriptional and translational stop sites, and other signal sequences are examples of nucleic acid sequences operably linked to other sequences.
  • operable linkage of DNA to a transcriptional control element refers to the physical and functional relationship between the DNA and promoter such that the transcription of such DNA is initiated from the promoter by an RNA polymerase that specifically recognizes, binds to and transcribes the DNA.
  • polynucleotide and “ nucleic acid ’ are used interchangeably. They refer to a natural or synthetic molecule, or some combination thereof, comprising a single nucleotide or two or more nucleotides linked by a phosphate group at the 3’ position of one nucleotide to the 5’ end of another nucleotide.
  • the polymeric form of nucleotides is not limited by length and can comprise either deoxyribonucleotides or ribonucleotides, or analogs thereof.
  • Polynucleotides may have any three-dimensional structure, and may perform any function.
  • polynucleotides coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers.
  • a polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs.
  • modifications to the nucleotide structure may be imparted before or after assembly of the polymer.
  • a polynucleotide may be further modified, such as by conjugation with a labeling component.
  • U nucleotides are interchangeable with T nucleotides.
  • the polynucleotide is not necessarily associated with the cell in which the nucleic acid is found in nature, and/or operably linked to a polynucleotide to which it is linked in nature.
  • protein protein
  • peptide polypeptide
  • polypeptide fragment may be used interchangeably herein to refer to polymers of amino acid, in certain embodiments prepared from recombinant DNA or RNA, or of synthetic origin, or some combination thereof, which (1) is not associated with proteins that it is normally found with in nature, (2) is isolated from the cell in which it normally occurs, (3) is isolated free of other proteins from the same cellular source, (4) is expressed by a cell from a different species, or (5) does not occur in nature.
  • polypeptide fragment when used in reference to a particular polypeptide, refers to a polypeptide in which amino acid residues are deleted as compared to the reference polypeptide itself, but where the remaining amino acid sequence is usually identical to that of the reference polypeptide. Such deletions may occur at the amino- terminus or carboxy-terminus of the reference polypeptide, or alternatively both. Fragments typically are at least about 5, 6, 8 or 10 amino acids long, at least about 14 amino acids long, at least about 20, 30, 40 or 50 amino acids long, at least about 75 amino acids long, or at least about 100, 150, 200, 300, 500 or more amino acids long. A fragment can retain one or more of the biological activities of the reference polypeptide.
  • a fragment may comprise an enzymatic activity and/or an interaction site of the reference polypeptide.
  • a fragment may have immunogenic properties.
  • specific binding refers to the ability of an antibody to bind to a predetermined antigen or the ability of a peptide to bind to its predetermined binding partner.
  • an antibody or peptide specifically binds to its predetermined antigen or binding partner with an affinity corresponding to a KD of about 10-7 M or less, and binds to the predetermined antigen/binding partner with an affinity (as expressed by KD) that is at least 10 fold less, at least 100 fold less or at least 1000 fold less than its affinity for binding to a non-specific and unrelated antigen/binding partner (e.g, BSA, casein).
  • a non-specific and unrelated antigen/binding partner e.g, BSA, casein
  • telomere binding reaction when referring to a polypeptide (including antibodies) or receptor, may refer to a binding reaction which is determinative of the presence of the protein or polypeptide or receptor in a heterogeneous population of proteins and other biologies; or to a binding reaction that results in blocking and/or inhibiting the expression and/or activity of a target gene.
  • a specified ligand or antibody “specifically binds” to its particular “target” (e.g, an antibody specifically binds to an antigen) when it does not bind in a significant amount to other proteins present in the sample or to other proteins to which the ligand or antibody may come in contact in an organism.
  • target e.g, an antibody specifically binds to an antigen
  • a first molecule that “specifically binds” a second molecule has an affinity constant (Ka) greater than about 10 5 M -1 (e.g, 10 6 M -1 , 10 7 M -1 , 10 8 M -1 , 10 9 M -1 , 10 10 M -1 , 10 11 M -1 , and 10 12 M -1 or more) with that second molecule.
  • Ka affinity constant
  • subject means a human or non-human animal selected for treatment or therapy.
  • transformation means the introduction of a nucleic acid, e. ., an expression vector, into a recipient cell (e.g, a mammalian cell) including introduction of a nucleic acid to the chromosomal DNA of said cell.
  • a recipient cell e.g, a mammalian cell
  • immunogenic or antigenic polypeptide includes polypeptides that are immunologically active in the sense that once administered to the host or a sample from said host, it is able to evoke an immune response of the humoral and/or cellular type directed against the protein (e.g, the binding of antibodies to the antigenic peptide, such as neutralizing antibodis).
  • An “immunogenic” protein or polypeptide, as used herein, includes the full-length sequence of the protein, analogs thereof, or immunogenic fragments thereof.
  • immunogenic fragment is meant a fragment of a protein which includes one or more epitopes and thus elicits the immunological response described above.
  • the invention encompasses active fragments and variants of the antigenic polypeptide.
  • the protein fragment is such that it has substantially the same immunological activity as the total protein.
  • a protein fragment according to the invention comprises or consists essentially of or consists of at least one epitope or antigenic determinant.
  • immunological or antigenic peptide/ polypeptide further contemplates deletions, additions and substitutions to the sequence, so long as the polypeptide functions to produce an immunological response as defined herein.
  • Such includes amino acid or peptide sequence having conservative amino acid substitutions, non conservative amino acid substitutions (e.g, a degenerate variant), substitutions within the wobble position of each codon (e.g, DNA and RNA) encoding an amino acid, amino acids added to the C-terminus of a peptide, or a peptide having 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% sequence identity to a reference sequence.
  • conservative amino acid substitutions e.g, a degenerate variant
  • substitutions within the wobble position of each codon e.g, DNA and RNA
  • vector refers to the means by which a nucleic acid can be propagated and/or transferred between organisms, cells, or cellular components.
  • Vectors include plasmids, viruses, bacteriophage, pro-viruses, phagemids, transposons, and artificial chromosomes, and the like, to which the nucleic acid has been linked, and may or may not be able to replicate autonomously or integrate into a chromosome of a host cell.
  • Such vectors may include any vector, (e.g, a plasmid, cosmid or phage chromosome) containing a gene construct in a form suitable for expression by a cell (e.g., linked to a transcriptional control element).
  • kits for treating or preventing severe coronavirus disease 2019 (COVID-19) in a subject comprising administering to the subject a composition comprising a modulating agent of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, ACSS1, or any combination thereof.
  • the modulating agents contemplated and disclosed herein may decrease or increase the activity or level of the corresponding gene products (e.g, transcript and/or protein).
  • the compositions disclosed herein comprise at least an inhibitor of ADAM9.
  • provided herein are methods of treating and/or preventing severe COVID-19 in a subject. In further aspects, provided herein are methods for predicting the likelihood of a subject infected with SARS-CoV-2 progressing to severe COVID-19.
  • such methods include (a) sequencing at least part of the subject's genome in a sample from said subject, wherein the at least part of said genome comprises an ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSl gene; (b) identifying from the sequencing of said sample at least one at least one single-nucleotide polymorphism (SNP) in one or more of genes: ADAM9,
  • MCEMP1, MS4A4A, RAB10 GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSl; and (c) administering a corresponding modulating agent that decreases or increases the expression or activity of the gene products of one or more of ADAM9, MCEMP1, MS4A4A, RAB10,
  • the method comprises (a) sequencing at least part of the subject's genome in a sample from said subject, wherein the at least part of said genome comprises an ADAM9 gene; (b) identifying from the sequencing of said sample at least one single-nucleotide polymorphism (SNP) in ADAM9; and (c) administering a corresponding inhibitor of the ADAM9 gene or its activity.
  • ADAM9 single-nucleotide polymorphism
  • the consequence of the at least one SNP is a frameshift mutation, nonsense mutation, missense mutation, or splice-site variant mutation.
  • the at least one SNP is located in a non-coding region of the gene and/or corresponding mRNA transcript.
  • the consequence of the at least one SNP is a 5' UTR variant, a 3' UTR variant, or an intron variant.
  • such SNPs include rs7840270, rs7831735, rsl 1465401, rsl 1465397, rsl89755275, rs76847438, rsl0736707, and rsl0792287.
  • the SNPs of interest are rs7840270 and/or rs7831735.
  • disclosed herein are methods of treating and/or preventing severe COVID-19 in a subject.
  • methods for predicting the likelihood of a subject infected with SARS-CoV-2 progressing to severe COVID-19 i.e., a critical COVID-19 subject.
  • said methods comprise (a) sequencing at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises at least one mRNA of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, orACSSl; (b) determining the expression level of at least one of ADAM9, MCEMPl, MS4A4A, RABIO, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 of step (a) and comparing it to a reference value, wherein the expression level of at least one of ADAM9, MCEMPl, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 gene relative to the reference value indicates whether the subject will respond to a corresponding modulating agent that decreases or increases the expression or activity of the gene products of ADAM9
  • said methods comprise (a) sequencing at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises the mRNA of ADAM9 ; (b) determining the expression level of the ADAM9 gene at the mRNA or protein level and comparing it to a reference value, wherein the expression level of the ADAM9 gene relative to the reference value indicates whether the subject will respond to an inhibitor of the ADAM9 expression or activity; and (c) administering said inhibitor oiADAM9 to the subject.
  • the expression level reference value is derived from a sample from a non-critical subject suffering from COVID-19 or is indicative of a non-critical subject suffering from COVID-19.
  • the expression level reference value is derived from a sample from an asymptomatic subject infected with SARS-CoV-2 or is indicative of an asymptomatic subject infected with SARS-CoV-2.
  • the expression level reference value is derived from a sample from a healthy subject or is indicative of a healthy subject.
  • provided herein are methods for monitoring a human subject suffering from CoVID-19 for potential treatment with a modulating agent that decreases or increases the expression or activity of the gene products of one or more of ADAM9,
  • the methods comprise a) obtaining a gene expression profile from the sample, wherein the expression profile comprises expression levels for one or more genes; wherein said one or more genes comprise one or more of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSSP, and b) comparing the gene expression profile of each sample chronologically, wherein an increase in one or more of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 expression over time identifies the subject as a critical subject; and c) administering to the subject the corresponding modulating agent or combination of modulating agents.
  • the methods comprise a) obtaining a gene expression profile from the sample, wherein the expression profile comprises expression levels for ADAM9; and b) comparing the gene expression profile of each sample chronologically, wherein an increase in ADAM9 expression over time identifies the subject as a critical subject; and c) administering to the subject an ADAM9 inhibitor.
  • the methods comprise (a) sequencing or genotyping of at least part of the subject's genome in a sample from said subject, wherein the at least part of said genome comprises one or more of an ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C or ACSS1 gene; (b) identifying from the sequencing or genotyping of said sample at least one SNP in one or more of genes ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1; and (c) using individual SNPs to form individual SNP risk scores or to combine multiple SNPs to define polygenic risk scores to provide an indication of the likelihood of progression to severe COVID-19.
  • the methods comprise: (a) sequencing or genotyping at least part of the subject's genome in a sample from said subject, wherein the at least part of said genome comprises one or more of an ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C or ACSS1 gene; (b) identifying from the sequencing or genotyping of said sample at least one SNP in one or more of genes ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1; (c) forming from said at least one SNP a feature vector; and (d) providing the feature vector to a trained classifier and receiving therefrom an indication of the likliohood of progression to severe COVID-19.
  • the methods comprise: (a) sequencing or other measurement or measuring (e.g . qPCR, digital PCR) of at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises at least one mRNA of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 genes; (b) determining the expression level of at least one of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1 of step (a); (c) forming from said expression level a feature vector; and (d) providing the feature vector to a trained classifier and receiving therefrom an indication of the liklio
  • the trained classifier comprises a LASSO model, a ridge regression model, a support vector machine (SVM), a quantum support vector machine (qSVM), an XGBoost model (XGB) a random forest (RF), or a DANN artificial neural network.
  • SVM support vector machine
  • qSVM quantum support vector machine
  • XGBoost XGBoost model
  • RF random forest
  • DANN DANN artificial neural network
  • methods for predicting the likelihood of a subject infected with SARS-CoV-2 progressing to severe COVID-19 comprising one or more of following steps: (a) measuring the level of soluble ADAM9 protein in a sample from the subject; (b) measuring the expression level of ADAM9 at the RNA level in a sample from the subject; and/or (c) measuring the expression level of ADAM9 at the protein level in a sample from the subject.
  • provided herein are methods for treating or preventing severe COVID-19 in a subject, comprising measuring in a sample from the subject the expression level of the ADAM9 gene. In some embodiments, measuring the expression level of the
  • ADAM9 gene comprises one or more of: (a) measuring the level of soluble ADAM9 protein; (b) measuring the expression level of ADAM9 at the RNA level; or (c) measuring the expression level of ADAM9 at the protein level; wherein when the level of ADAM9 expression exceeds a threshold limit the subject is administered an ADAM9 inhibitor; and wherein when the level of ADAM9 expression does not exceed said threshold limit the subject is not administered an ADAM9 inhibitor.
  • the disclosed methods of treating severe COVID-19 may include (a) bringing a biological sample into contact with an antibody immobilized on a solid support, wherein said antibody specifically binds an ADAM9-induced peptide cleavage product; (b) incubating the biological sample in contact with the immobilized antibody under conditions such that a cleavage product-antibody complex is formed when the cleaved peptide is present in the biological sample; (c) contacting said cleavage product-antibody complex with a reporter group-conjugated anti-immunoglobulin; (d) incubating the cleavage product- antibody complex in contact with the reporter group-conjugated anti-immunoglobulin under conditions such that a cleavage product-antibody-reporter group-conjugated anti immunoglobulin complex is formed when the cleaved peptide is present in the biological sample; (e) bringing a biological sample into contact with an antibody immobilized on a solid support, wherein said
  • the product or the change in the substrate measured is proportional to the amount of ADAM9-induced peptide cleavage product in the biological sample.
  • the subject when the level of ADAM9-induced peptide cleavage product exceeds a threshold limit the subject is administered an ADAM9 inhibitor.
  • the level of ADAM9- induced peptide cleavage product does not exceed said threshold limit the subject is not administered an ADAM9 inhibitor.
  • the method comprises (a) sequencing of at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises at least the 600 genes in the genomic signature disclosed herein; (b) determining the expression levels of the at least the 600 genes in the genomic signature disclosed herein; (c) forming from said expression levels a feature vector; and (d) providing the feature vector to a trained classifier and receiving therefrom an indication of the likelihood of progression to severe COVID-19.
  • the methods comprise (a) sequencing of at least part of the subject's transcriptome in a sample from said subject, wherein the at least part of said transcriptome comprises at least the 600 genes in the genomic signature disclosed herein; (b) determining the expression levels of the at least the 600 genes in the genomic signature disclosed herein; (c) forming from said expression levels a feature vector; and (d) providing the feature vector to a trained classifier and receiving therefrom an indication of the likelihood of progression to severe ARDS.
  • ARDS also typically occurs in people who are already critically ill or who have significant injuries.
  • the signs and symptoms of ARDS can vary in intensity and can include, Severe shortness of breath, labored and unusually rapid breathing, low blood pressure, confusion and extreme tiredness.
  • the underlying causes of ARDS may include sepsis; damage to the tissues of the lungs such as by inhalation fo harmful substances (e.g ., high concentrations of smoke, chemical fumes/inhalants, as well as damage caused by aspiration, such as the aspiration of vomit or as a result near-drowning; severe pneumonia, physical traumatic such as to the head, chest, or other major injury (e.g., damage caused by falls, car crashes, gunshot wounds, and the like); pancreatitis; severe bum injury; massive blood transfusion.
  • the subject is suffereing from a viral infection.
  • the subject is suffering from a non-viral infection or inflammation.
  • the subject is suffering from traumatic injury.
  • the sample is a tissue sample or a bodily fluid sample.
  • the sample is a blood sample.
  • the sample comprises serum or sera derived from the subject.
  • agents e.g ., activators and/or inhibitors
  • a target gene e.g., the level of transcript or active protein
  • such agents include modulating agents of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, and/or ACSS1.
  • the modulating agent is a chemical compound, a small molecule, a mixture of chemical compounds and/or a biological macromolecule (such as a nucleic acid, an antibody, an antibody fragment, a protein or a peptide).
  • the agents contemplated herein include those disclosed herein, those known in the art, and those that may be identified by screening or validation assays disclosed herein.
  • the modulating agent is an inhibitor.
  • the agent is an inhibitor of ADAM9.
  • Small molecule inhibitors known in the art include Batimastat, Marimastat, and CGS27023.
  • the modulating agent is an antibody or antibody fragment that binds specifically to the protein expressed by the target gene.
  • the antibody depletes, neutralizes, or inhibits one or more associated activities of said protein.
  • Such antibodies include, but are not limited to, RAV-18, KID-24, and fragments thereof.
  • the antibody may induce/ activate or enhance one or more associated activities of said protein, such as anti-CD79b and the like.
  • the inhibitor is an interfering nucleic acid specific for an mRNA product of a target gene disclosed herein.
  • interfering nucleic acids are known in the art and include, without limitation, siRNAs, shRNAs, miRNAs, peptide nucleic acids (PNAs), and the like, as are known in the art.
  • the interfering nucleic acid is a siRNA, such as HSS112867 (Thermofisher Scientific, US).
  • a combination of modulating agents may be administered to the subject in need thereof.
  • the combination and administration of such modulating agents is informed, at least in part, by the methods disclosed herein.
  • the combination of modulating agents may be of inhibitors or activators of a plurality of different genes, multiple inhibitors or activators of the same gene, or combinations of such inhibitors and activators.
  • the combination of modulatory agents can be administered either in the same formulation or in separate formulations, either concomitantly or sequentially.
  • a subject who receives such personalized treatment can benefit from a combined effect of different therapeutic agents.
  • kits for use in performing any of the methods disclosed herein are kits for use in performing any of the methods disclosed herein.
  • kits as are contemplated herein include, in sufficient for at least one assay, a composition comprising a coronavirus antigen of the current invention as a separately packaged reagent. Instructions for use of the packaged reagent are also typically included. “Instructions for use” typically include a tangible expression describing the reagent concentration or at least one assay method parameter such as the relative amounts of reagent and sample to be admixed, maintenance time periods for reagent/sample admixtures, temperature, buffer conditions and the like.
  • in vitro diagnostic kits for the analysis and/or detection of driver genes such as (without limitation) one or more of ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C, or ACSS1.
  • the in vitro diagnostic kits provided herein are for the analysis of at least part of a subject’s genome, e.g ., for the detection and identification of single-nucleotide polymorphisms (SNPs) in one or more driver genes disclosed herein.
  • the in vitro diagnostic kits provided herein are for the detection and/or analysis of the expression level (e.g, transcript or protein level) of one or more driver genes disclosed herein.
  • such in vitro diagnostic kits contemplated herein are for the detection of soluble ADAM9 protein.
  • the in vitro diagnostic kits provided herein are for the detection and/or analysis of the activity of the gene product of one or more driver genes disclosed herein, e.g., detection and analysis of the proteolytic activity of ADAM9 protein.
  • the diagnostic system of the present invention further includes a label or indicating means capable of signaling the formation of a complex containing a recombinant antigen.
  • label and “indicating means” in their various grammatical forms refer to single atoms and molecules that are either directly or indirectly involved in the production of a detectable signal to indicate the presence of a complex. Any label or indicating means can be linked to or incorporated in an expressed protein or polypeptide, or used separately, and those atoms or molecules can be used alone or in conjunction with additional reagents. Such labels are themselves well-known in clinical diagnostic chemistry and constitute a part of this invention only insofar as they are utilized with otherwise novel proteins methods and/or systems.
  • the diagnostic kits of the present invention can be used in an “ELISA” format to detect and quantify peptides, proteins, antibodies, and hormones of interest identified by the methods disclosed herein.
  • ELISA refers to an enzyme- linked immunosorbent assay that employs an antibody or antigen bound to a solid phase and an enzyme-antigen or enzyme-antibody conjugate to detect and quantify the amount of an antigen or antibody present in a sample.
  • a description of the ELISA technique is found in Chapter 22 of the 4th Edition of Basic and Clinical Immunology by D. P. Sites et al., published by Lange Medical Publications of Los Altos, Calif in 1982 and in U.S. Pat. Nos. 3,654,090; 3,850,752; and 4,016,043, which are all incorporated herein by reference.
  • control group including 22 healthy age and sex-matched blood donors under 50 years old were included as a “control group”.
  • Blood sampling was performed at ward/ICU admission and for ICU patients every four days until hospital discharge.
  • Venipunctures were performed at admission in ICU or medical ward within the framework or routine diagnostic procedures. A subset of ICU patients (73%) were sampled every 4-8 days post-hospitalization until discharge or death.
  • Patient blood was collected in a BD Vacutainer tube with Heparin (for plasma and PBMC), EDTA (for DNA) or without additive (for serum) and in PAXgene® Blood RNA tubes (Becton, Dickinson and Company, USA). Healthy donors were sampled in BD Vacutainer tubes with Heparin, with EDTA or without additive. Plasma and serum fractions were collected after centrifugation at 1200 x g at room temperature for 10 min, aliquoted, and stored at -80°C until use.
  • PBMCs Peripheral Blood Mononuclear Cells
  • FCS fetal calf serum
  • DMSO Dimethyl Sulfoxide
  • Plasma were analyzed with the V-PLEX Proinflammatory Panel 1 Human Kit (IL-6, IL-8, IL-10, TNF-a, IL-12p70, IL-Ib, GM-CSF, IL-2, and IFN-g) and the S-PLEX Human IFN-a2a Kit following the manufacturer’s instructions (Mesoscale Discovery, USA). Plasma were used undiluted for the S-PLEX Human IFN-a2a Kit and diluted 2 times for the V-PLEX Proinflammatory Panel 1. MSD plates were analyzed on the MS2400 imager (Mesoscale Discovery, Gaithersburg, MD). Soluble IL-17 was quantified by Quantikine® HS ELISA (Human IL-17 Immunoassay) on undiluted serum followings the manufacturer’s instructions (R&D Systems, Minneapolis, MN). All standards and samples were measured in duplicate.
  • V-PLEX Proinflammatory Panel 1 Human Kit IL-6, IL-8, IL-10, TNF-a, IL-12p70, IL-I
  • PBMC peripheral blood mononuclear cells
  • FCS files of each group Healthy, Critical, Non-Critical were then concatenated with CyTOF® software v.7.0.8493.0 for viSNE analysis (Cytobank Inc, USA). A total of 300,000 events were used for viSNE maps that was generated with the following parameters: iterations (1,000), perplexity (30) and theta (0.5). ViSNE maps are presented as means of all samples in each group.
  • Samples were prepared using the PreOmics iST Kit (PreOmics GmbH, Martinsried, Germany) according to the manufacturer’s protocol. Two m ⁇ of plasma were mixed with 50 m ⁇ Lyse buffer. Briefly, protein concentration was determined using the Bradford assay (Biorad, USA) according to the manufacturer’s instructions. Samples were transferred to 96 well-plate cartridges. Then, 50 m ⁇ of resuspended Digest solution were added and samples were heated at 37 °C for 2 h before adding 100 m ⁇ of Stop buffer. Samples were centrifuged in order to retain the peptides on the cartridge and washed twice with “Wash 1” and “Wash 2” buffers.
  • NanoLC-MS/MS analyses were performed on a nanoAcquity UltraPerformance LC® (UPLC®) device (Waters Corporation, USA) coupled to a Q-ExactiveTM Plus mass spectrometer (Thermo Fisher Scientific, USA). Peptide separation was performed on an ACQUITY UPLC BEH130 C18 column (250 mm x 75 pm with 1.7 pm diameter particles) and a Symmetry C18 precolumn (20 mm x 180 pm with 5 pm diameter particles, Waters). The solvent system consisted of 0.1% FA in water (solvent A) and 0.1% FA in ACN (solvent B).
  • Samples (equivalent to 500 ng of proteins) were loaded into the enrichment column over 3 min at 5 pL/min with 99% of solvent A and 1% of solvent B.
  • the peptides were eluted at 400 nL/min with the following gradient of solvent B: from 1 to 35 % over 60 min and 35 to 90 % over 1 min.
  • the 93 samples were injected in randomized order.
  • the MS capillary voltage was set to 2.1 kV at 250 °C.
  • the ten most abundant ions were selected on each MS spectrum for further isolation and higher energy collision dissociation fragmentation, excluding unassigned and monocharged ions.
  • the dynamic exclusion time was set to 60s.
  • a sample pool comprising equal amounts of all protein extracts was constituted and regularly injected during the course of the experiment, as an additional Quality Control.
  • the minimal peptide length required was seven amino acids and a maximum of one missed cleavage was allowed.
  • Methionine oxidation and acetylation of protein’s N- termini were set as variable modifications and acetylated and modified methionine-containing peptides, as well as their unmodified counterparts, were excluded from protein quantification. Cysteine carbamidomethylation was set as a fixed modification. For protein quantification, the “match between runs” option was enabled. The maximum false discovery rate was set to 1% at peptide and protein levels with the use of a decoy strategy.
  • LFQ intensities were extracted from the ProteinGroups.txt file after removal of non-human and keratin contaminants, as well as reverse and proteins only identified by site. Complete datasets have been deposited in the ProteomeXchange Consortium database with the identifier PXD 025265 (Alhazzani et al., 2020).
  • LFQ Normalized label -free quantification
  • Samples were prepared using the PreOmics’ iST Kit (PreOmics GmbH, Martinsried, Germany) according to the manufacturer’s protocol. Briefly, PBMC pellets were resuspended in 50 m ⁇ Lyse buffer and heated at 95 °C for 10 min at 1,000 rpm before being sonicated for 10 min on ice. Protein concentration of the extract was determined using the Bradford assay (Biorad, Hercules, USA) according to the manufacturer’s instructions. Samples were transferred to 96 well-plate cartridges. Then, 50 m ⁇ of resuspended Digest solution were added and samples were heated at 37 °C for 2 h before adding 100 m ⁇ of Stop buffer.
  • NanoLC-MS/MS analyses were performed on a nanoAcquity UPLC device (Waters Corporation, USA) coupled to a Q-Exactive HF-X mass spectrometer (Thermo Fisher Scientific, USA). Peptide separation was performed on an Acquity UPLC BEH130 C18 column (250 mm x 75 pm with 1.7 pm diameter particles) and a Symmetry C18 precolumn (20 mm x 180 pm with 5 pm diameter particles, Waters).
  • the solvent system consisted of 0.1% Formic Acid (FA) in water (solvent A) and 0.1% FA in Acetonitrile (ACN) (solvent B).
  • Samples (equivalent to 414 ng of proteins) were loaded into the enrichment column over 3 min at 5 pL/min with 99 % of solvent A and 1 % of solvent B.
  • the peptides were eluted at 400 nL/min with the following gradient of solvent B: from 2 to 25 % over 53 min, 25 to 40 % over 10 min and 40 to 90 % over 2 min.
  • the 77 samples were injected using a randomized injection sequence.
  • the MS capillary voltage was set to 1.9 kV at 250 °C.
  • AGC Automatic gain control
  • AGC fixed at 1 x 10 5
  • the dynamic exclusion time was set to 60 s.
  • a sample pool comprising equal amounts of all protein extracts was constituted and regularly injected during the course of the experiment, as an additional Quality Control.
  • Raw data obtained for each sample 34 Critical Patients, 21 Non-Critical patients and 22 healthy controls
  • MaxQuant software version 1.6.14
  • Peaks were assigned with the Andromeda search engine with trypsin/P specificity.
  • a combined human and bovine database (because of potential traces of fetal calf serum in samples) was extracted from UniProtKB-SwissProt (as of September 8, 2020, 26,413 entries). The minimal peptide length required was seven amino acids and a maximum of one missed cleavage was allowed.
  • Methionine oxidation and acetylation of protein’s N-termini were set as variable modifications and acetylated and modified methionine-containing peptides, as well as their unmodified counterparts, were excluded from protein quantification. Cysteine carbamidom ethylation was set as a fixed modification. For protein quantification, the “match between runs” option was enabled. The maximum false discovery rate was set to 1% at peptide and protein levels with the use of a decoy strategy. Only peptides unique to human entries were kept and their intensities were summed to derive protein intensities. Complete datasets have been deposited in the ProteomeXchange Consortium database with the identifier PXD 025265 (Deutsch et al., 2017).
  • LFQ Normalized label -free quantification
  • WGS data was generated from DNA isolated from whole blood. Illumina Novaseq- 6000 machines were used for DNA sequencing to a mean 3 OX coverage. Raw sequencing reads from FASTQ files were aligned using Burrows-Wheeler Aligner (BWA) (Li and Durbin, 2009) and GVCF files were generated using Sentieon version 201808.03 (Kendig et al., 2019). Functional annotation of variants was done using Variant Effect Predictor from Ensembl (version 101). GATK version 4 (Van der Auwera et al., 2013; DePristo et al., 2011) was used for joint genotyping process and variant quality score recalibration (VQSR).
  • VQSR Variant Effect Predictor from Ensembl
  • RNA seguencins (RNA-seq)
  • RNA sequencing libraries were generated using TruSeq Stranded Total RNA with Ribo-Zero Globin kit (Illumina, USA) and sequenced on the Illumina NovaSeq 6000 instrument with S2 flow cells and 15 lbp paired-end reads.
  • Raw sequencing data was aligned to a reference human genome build 38 (GRCh38) using short reads aligner STAR (Dobin et ah, 2013). Quantification of gene expression was performed using RSEM (Li and Dewey, 2011) with GENCODE annotation v25 (http://www.gencodegenes.org). Raw and processed datasets have been deposited in GEO with identifier GSE172114.
  • DGE Differential gene expression
  • DGE analysis was performed for each cut of the train data using a frozen normalization approach to normalize library sizes using the trimmed mean of M-values method (TMM) from the edgeR R package (Robinson and Oshlack, 2010; Robinson et ak, 2010). Briefly, low expressed genes were removed for the 69 samples with genes with 1 count per million in less than 10% of samples. For each cut of the train data, the normalization factors were calculated, then the library that had a normalization factor closest to 1 was selected. This was used as a reference library to normalize all samples keeping the training normalization factors unchanged. Differentially expressed genes were identified using a quasi-likelihood F-test (QLF) adjusted P values from edgeR R package. Differentially expressed genes with false discovery rate (FDR) less than 0.05 were used for further downstream analysis.
  • QLF quasi-likelihood F-test
  • classification as a feature selection approach was used, and then the most informative features were used as input to structural causal modeling to identify potential driver genes. More specifically, classification was performed on the RNA- seq data by repeatedly splitting Non-critical and Critical into 100 unique training and independent test sets representing 80% and 20% of total data, respectively, ensuring that the proportions of Non-critical and Critical patients was consistent in each split of the data. 100 splits of the data were used in order to capture biological variation and have more statistical confidence in the results. After classification, feature scores for each method were determined and combined across all 100 splits of the data and 6 of the machine learning algorithms, not including the deep learning. The top 600 most informative features were retained for structural causal modeling.
  • the output of the structural causal modeling returned a putative directed network depicting the flow of causal information.
  • differential expression for the plasma and PBMC proteomics data was also performed, SKAT for the WGS data, and eQTL and pQTL analysis for the genomic and proteomics data, respectively.
  • Hyper parameters were chosen by using 10-fold cross-validation on the training data, with performance evaluated on the held-out test data.
  • LASSO (Tibshirani, 1996) is an Ll-penalized linear regression model defined as: Ridge (Hoerl and Kennard, 1970; Hoerl et al., 1975) is an L2-penalized linear regression model defined as: where
  • l > 0 is the regularization parameter that controls model complexity
  • b are the regression coefficients
  • b 0 is the intercept term
  • y are the class labels
  • x t is the ith training sample
  • the goal of the training procedure is to determine b, the optimal regression coefficients that minimize the quantities defined in Eqs. (1) and (2).
  • the constraint placed on the norm of b (the strength of which is given by A) causes coefficients of uninformative features to shrink to zero. This leads to a simpler model that contains only a few non-zero coefficients.
  • the ‘glmnef function from the caret (Kuhn, 2008) R package was used to train all LASSO and Ridge models. Ridge plays a similar role in determining model complexity, except that coefficients for uninformative features do not necessarily shrink to zero.
  • SVM Support Vector Machines
  • Support vector machines (SVMs) (Boser et al., 1992; Cortes and Vapnik, 1995) are a set of supervised learning models used for classification and regression analysis.
  • the primal form of the optimization problem is: where L p is the loss function in its primal form (p for primal), w are the weights to be determined in the optimization, x t is the ith training sample, y is the label of the ith training sample, a t > 0 are Lagrange multipliers, N is the number of training points, and b is the intercept term. Labels are predicted by thresholding x t w + b.
  • L D is the Lagrangian dual of the primal problem
  • a L are the Lagrange multipliers
  • y L and x t are the ith label and training sample, respectively
  • C is a hyper-parameter that controls the degree of misclassification of the model for nonlinear classifiers.
  • the optimal value of w and b can found in terms of the s, and the label of a new data point x can be found by thresholding the output a L y t K (x L , x) + b.
  • C ranged from 2 L (-2) to 2 L 3, and a 10-fold cross-validation was used to tune and select the hyperparameters with the best cross- validation accuracy for training the model.
  • Random Forest (Breiman, 2001; Breiman et ah, 1993) is an ensemble learning method for classification and regression which builds a set (or forest) of decision trees.
  • n samples are chosen (typically two-thirds of all the training data) with replacement from the training data m times, giving m different decision trees.
  • Each tree is grown by considering ‘mtry’ of the total features, and the tree is split depending on which features gives the smallest Gini impurity.
  • the predicted label is given by the mode of all the training samples in a terminal node.
  • the final prediction for a new sample x is determined by taking the majority vote over all the trees in the forest.
  • the ‘rf function was used from the caret (Kuhn, 2008) R package to train all Random Forest models.
  • a 10-fold cross-validation was used to tune parameters for training the model.
  • XGBoost (XGB) XGBoost (Chen and Guestrin, 2016) is a distributed gradient boosting library for classification and regression by building an ensemble of decision trees.
  • XGBoost uses an additive strategy to add new trees one at a time based on whether they optimize the objective function.
  • the objective function for the t-th tree is: where G j - y ), H j 2 ⁇ l j ⁇ , X and g are hyper-parameters controlling model complexity, T is the number of leaves in the trees, w ; ⁇ is the combined score across all the data points for the j- th leaf.
  • I j refers to the set of indices of data points assigned to the j- th leaf
  • l j I is the size of the set I j
  • y t - is the actual label of the t-th data point.
  • the default parameter tuning grid in R was used, and a 10-fold cross-validation was used to tune and select the hyperparameters with the best cross-validation accuracy for training the model.
  • Quantum support vector machine is a quantum adaptation of SVM that can be used for classification designed to be run with a quantum annealer (QA) (Willsch et al., 2020).
  • QA quantum annealer
  • the advantage of running the optimization problem on a QA is that, since the QA samples from the quantum distribution, it retains both the lowest energy solution and some of the next lowest-energy solutions.
  • qSVM is expected to perform worse on the train data than classical SVM (which only includes optimal solution).
  • sub-optimal solutions can capture different aspects of train data, and generate different decision boundaries. As such, a suitable combination of the suboptimal solutions in qSVM might outperform cSVM on the test data.
  • the optimization problem gets the form of a Quadratic Unconstrained Binary Optimization (QUBO) problem, which can be run on a QA:
  • QUBO Quadratic Unconstrained Binary Optimization
  • DANN Deep learning methodologies were adapted to analyze genomic datasets (Alipanahi et al., 2015)
  • Typical deep neural networks use a series of nonlinear transformations (termed layers), with the final output considered a prediction of class or regression variable.
  • Each layer consists of a set of weights (W) and biases (b) that are tuned during a training phase to learn which nonlinear combinations of input features are most important for the prediction task.
  • W weights
  • b biases
  • These types of models “automatically” learn patterns in the data and combine them, in some abstract nonlinear fashion, to gain an ability to make predictions about the dataset.
  • Pi(z) max(o,z).
  • the final layer used a softmax function, with the number of neurons equal to the number of class ( K ), to convert the logits to probabilities: where f mj is the output of the y-th neuron of the m-th layer.
  • the concept of “dropout” was used, which randomly sets a portion of input values (h) to the layer to zero during the training phase (Srivastava et ah, 2014). This has a strong regularization effect (essentially by injecting random noise) that helps prevent models from overfitting. Layers that included dropout were formulated as where m L ⁇ Bernoulli ⁇ ).
  • LASSO, Ridge, SVM, and qSVM are linear models, and thus the feature importance was determined based on the value of the weight assigned to each feature, with a larger score corresponding to greater importance.
  • Random Forest creates a forest of decision trees, and as part of the fitting process determines an estimate of the feature importance by randomly permuting the features one at a time and determining the change in the accuracy.
  • XGBoost calculates feature importance by averaging the gain across all the trees, where the gain is the difference in the Gini purity of the parent node and the two children nodes.
  • the top 1000 most informative features for each model, for each cut of the data were retained for each of the 100 cuts of the training data. Because there were 100 cuts of the data, 6 algorithms (LASSO, Ridge, SVM, qSVM, RF, and XGBoost; DANN was not included because it lacks a robust approach to determine feature importance), and up to 1000 features retained, a total of up to 600,000 possible features were considered for each feature set (though they may not be unique, as the top 1000 features for one cut of the data may have some overlap with the top 1000 features for another cut of the data). Feature scores from an algorithm on any cut that had a test AUROC ⁇ 0.7 were discarded, in an attempt to exclude scores that may not truly be informative.
  • the scores were scaled by the most informative feature for each algorithm on each cut, such that the feature scores all lay between 0 and 1, /. e. , for the first cut of the data the 1000 most informative features from LASSO were scaled, then the same was done for Ridge, SVM, Random Forest, and the process repeated for each cut of the data. Scores were then averaged across all the cuts of the data to give a feature ranking for each method. If a feature was determined to be important for one cut of the data but not for others, it was given a value of 0 for all cuts of the data in which it did not appear. To determine a final ensemble feature ranking, the grand mean across all training cuts and algorithms was taken, and the features were sorted by the average score.
  • BBNs were generated for the top 600 most informative genes as defined by ensemble feature ranking described above. BBNs were used to assess the conditional dependence and probabilistic relationships between the most informative genes. Briefly, a minibatch stochastic gradient descent with Nesterov momentum was used to update the DANN parameters based on the loss function above (Sutskever et al., 2013). The TensorFlow (Abadi et al., 2016) python package was used to construct the DANNs. G.
  • causal sufficiency assumption where there are no unobserved cofounders
  • causal Markov assumption where all d-separations in the graph (G) imply conditional independence in the observed probability distribution
  • causal faithfulness assumption where all of the conditional independences in the observed probability distribution imply d-separations in the graph ( G ).
  • the data may not strictly meet all of these assumptions, however the generated BBNs provide useful biological hypothesis that could be experimentally validated.
  • BBNs were determined using the bnleam R package with the score-based hill climbing algorithm that heuristically searched the optimality space of all possible DAGs (Scutari, 2010). As the hill-climbing algorithm can get trapped in local optima and is quite dependent on the starting structure, 100 BBNs starting from different network seeds were initialized. During the hill-climbing process, each candidate BBN was assessed with the Bayesian information criterion (BIC) score (Lam and Bacchus, 1994; Scutari, 2010): d
  • BIC log L (A 1; ... ,X v ) - ⁇ log n, where X ... ,X V is the node set, d is the number of free parameters, n is the sample size of the dataset, and L is the likelihood.
  • This definition of the BIC which is the version implemented in the bnleam package, rescales the classic definition by -2. The penalty term was used to prevent overly complicated structures and overfitting. Each ran of the hill climbing algorithm returns a structure that maximizes the BIC score (including evaluating the directions of edges). A caveat is that these structures may be partially oriented graphs (i.e., situations where the directionality of some edges cannot be effectively determined).
  • the cextend function from the bnleam package was used to construct a DAG that is a consistent extension of X.
  • a consensus network based on the 100 networks after hill-climbing was then generated, wherein edges that were present in graphs at least 30% of the time were kept. Any residual undirected edges contained in the consensus network were discarded.
  • Statistical significance of edges within the imposed consensus network was assessed by randomly permuting the dataset 10,000 times and evaluating the consensus structure on these scrambled datasets (thus providing an estimate of the null distribution). BBN edges with a false discovery rate of 5% (i.e., the edge occurred in >500 of the random BBNs) or greater were removed from the final network.
  • CTTCGAAGTAGCTGAGTCATGCTGG-3 CTTCGAAGTAGCTGAGTCATGCTGG-3’ and GAPDH as a housekeeping gene: forward
  • RT-qPCR protocol consisted of: 95°C for 2 min, followed by 40 cycles: 95°C for 5 sec and 60°C for 30 sec. All reactions were performed in duplicate and the relative amounts of transcripts were calculated with the comparative Ct method. Gene expression changes were calculated using 2 DDa values calculated from averages of technical duplicates, relative to the negative control. Melting-curve analysis was performed to assess the specificity of the PCR products.
  • Enzyme-Linked Immunosorbent Assays ELISA
  • Soluble ADAM9 (sADAM9) and soluble MICA (sMICA) were quantified by ELISA on serum of Critical patients, Non-Critical patients and healthy controls.
  • soluble ADAM9 Human sADAM9 DuoSet ELISA kit (R&D Systems, Minneapolis, MN, USA) was used following manufacturer’s instructions.
  • sMICA levels were measured with an in-house developed sandwich enzyme-linked immunosorbent assay (ELISA) using two monoclonal mouse antibodies for capture (A13-C485B10 and A9-C255A9 at 2 mg/ml and 0.2 mg/ml, respectively) and one biotinylated monoclonal mouse antibody for detection (A15-C199B9 at 60 pg/ml).
  • Vero E6 cell lines were grown at 37 °C under 5% CO2 and maintained in DMEM Medium (ThermoFisher Scientific, USA) containing 100 units/ml penicillin, which was supplemented with 10% fetal bovine serum (Pan Biotech, Germany).
  • ACE2-expressing A549 cells (A549-ACE2) were grown at 37 °C under 5% CO2 and maintained in DMEM Medium (ThermoFisher Scientific, USA) containing 10 pg/ml of Blasticidine S (Invitrogen, USA).
  • Cells were transfected with predesigned Stealth siRNA directed against ADAM9 (HSS112867) or the control Stealth RNAi Negative Control Duplex medium GC (45-55%) (ThermoFisher Scientific, USA) by using LipofectamineTM 3000 Reagent (ThermoFisher Scientific, USA).
  • LipofectamineTM 3000 Reagent One day prior to transfection, the cells were seeded in a 24-well plate at 0.05 x 10 6 cells per well.
  • First 1.5 pi of LipofectamineTM 3000 Reagent were added to 25pl of Opti-MEMTM medium, followed by addition of the mix containing 5 pmoles of siRNA in 25 pi of Opti-MEMTM medium (ThermoFisher Scientific, EISA). The mixture was incubated at room temperature for 10 min and then added to the cells. The cells were collected or infected after 48h.
  • Bound antibodies were revealed with an enhanced chemiluminescence detection system using the ChemiDoc XRS (Bio-Rad Laboratories, USA). Loading control was performed with an anti-GAPDH antibody (MAB374, Merck Millipore, USA).
  • Vero E6 and A549-ACE2 cell lines were infected with SARS-CoV-2 wild type virus at MOI of 10 and 400, respectively. Percentage of infected cells was determined by staining with SARS-CoV-2 Nucleocapsid (% of Nucleocapsid positive cells) and virus released in the supernatant was analyzed by RT-PCR (copies/ml) after 2 and 3 days of infection for Vero E6 and A549-ACE2 cells, respectively.
  • RT-qPCR was performed using TaqPathTM 1-Step RT-qPCR Master Mix, CG on the Quanstudio3 instrument (ThermoFisher Scientific, USA).
  • the primer/probe mix used for absolute quantification of the virus are N1 and N2 from the 2019-nCoV RUO Kit (Integrated DNA Technologies, USA), and the positive control for the standard curve was 2019-nCoV N Positive Control (Integrated DNA Technologies, USA).
  • the reaction was performed in 20 m ⁇ , including 5 pi of eluted RNA, 5 m ⁇ of TaqPath master mix and 1.5 m ⁇ of primer/probe.
  • the RT-qPCR protocol consisted of: 25°C for 2 min, 50°C for 15 min, 95°C for 2 min, followed by 40 cycles: 95°C for 3 sec and 60°C for 30 sec. All reactions were performed in duplicate and the absolute quantification was calculated with the standard curve of the positive control.
  • BMI body mass index
  • IL-6 interleukin 6
  • IQR interquartile range.
  • ARDS acute respiratory distress syndrome
  • ECMO extracorporeal membrane oxygenation
  • IQR interquartile range
  • NMBA neuromuscular blocking agent
  • RRT renal replacement therapy
  • SAPSII simplified acute physiology score II
  • SOFA Sequential Organ Failure Assessment.
  • PBMC Peripheral Blood Mononuclear Cells
  • CDT ® mass-cytometry
  • RNA-seq and WGS was performed on whole blood. Unless otherwise specified, all measures were made on samples that were taken at the time of entry into the ICU or the non-critical care ward. Validation of the identified driver genes and pathways was performed using an ex vivo model of SARS- CoV-2 infection and a validation cohort of 81 critical patients and 73 recovered critical patients.
  • Example 3 Cytokines, antibodies, and immune cell hallmarks of critical COVID-19
  • the global pro-inflammatory cytokine profile showed a significantly increased concentration of IFNy, TNFa, IL-Ib, IL-4, IL-6, IL-8, IL-10 and IL-12p70 in critical versus non-critical patients ( Figure 2A).
  • This “cytokine storm” (Mehta et ak, 2020) is more pronounced in critical cases, as only IFNy, TNFa and IL-10 are higher in non-critical patients as compared to healthy controls.
  • Example 4 Quantitative plasma and PBMC proteomics highlight signatures of acute inflammation, myeloid activation and blood coagulation
  • Example 5 Combined transcriptomics and proteomics analysis supports inflammatory pathways associated with critical disease.
  • nGOseq nGOseq Nature 2017 May 11;545(7653):224-2278
  • Functional enrichment was performed on differentially expressed genes or proteins in RNA-seq, plasma and PBMC proteomics data.
  • Figure 6C shows the nGOseq terms that were statistically enriched in at least two omics datasets in critical vs. non-critical patients.
  • cytokine profiling Figure 2A
  • IL- 1, IL-8 and IL-12 pro-inflammatory cytokine release
  • nGOseq enrichment also indicated that the dysfunction in blood coagulation involves a fibrinolytic response, an observation that could, however, be linked to the anti-coagulant therapy of most critical patients (91% of critical patients vs. 56% of non-critical patients were treated with heparin).
  • nGOseq terms related to viral entry and even viral transcription were strongly enriched in the three omics datasets. This result was concordant with the identification of viral gene transcripts in RNA-seq data of 8 critical patients but not in non- critical patients (Table 3).
  • ROCs Receiver Operating Characteristic curves
  • Figure 7A and Table 4 The Receiver Operating Characteristic curves (ROCs) for the 100 partitions of patient data as well as other classification performance metrics are shown in Figure 7A and Table 4.
  • the classification performance on the test set provided a high degree of confidence that the signals learned by the various AI/ML algorithms are generalizable.
  • Table 4 Performance metrics on the train and test set for each algorithm in the ensemble computational intelligence approach.
  • SCM of RNA-seq data produces causal dependency structures, indicative of the signal transduction cascades that occur within cells and drive phenotypic and pathophenotypic development (Ricard et ah, J Exp Med, 2019).
  • DAG directed acyclic graph
  • Figure 7B a gene network representing the putative flow of causal information, with genes on the left predicted to have the greatest degree of influence on the entire state of the network. Perturbing these genes is most disruptive to the state of the network ( Figure 8), and is expected to have the greatest effect on the expression of downstream genes.
  • the top five genes that associated with the greatest degree of putative causal dependency ar eADAM9, RAB10 , MCEMP1 , MS4A4A and GCLM , all five being significantly up-regulated in critical patients (Figure 7C).
  • the DAG also shows 5 downstream genes at the right of the graph in Figure 7B ( EPHX2 , RORA, CFAP97, ARL4C or ACSS1) which are predicted to have the greatest change in expression due to change in the 5 driver genes described above.
  • downstream genes may be useful to monitor the effects of therapy of COVID-19 ARDS by methods known in the art (e.g ., qPCR, qRT-PCR, digital PCR, ELISA, and the like)using one or more driver genes as drug targets.
  • qPCR qRT-PCR
  • digital PCR digital PCR
  • ELISA ELISA
  • the usefulness of the 600 genes identified in this first group of patients was then evaluated in a second patient cohort, consisting of critical COVID-19 patients sampled at ICU entry and recovered critical patients sampled at three months after ICU exit.
  • the top 600 genes from the first patient cohort were able to significantly differentiate between critical and recovered patients ( Figures 9A, 9B, and Table 5); classification performance when training on the differentially expressed genes between critical and recovered patients is nearly the same (not shown), indicating the high degree of generalizability of this gene signature.
  • the five identified driver genes in patient cohort 1 were also shown to be up- regulated in critical patients in this second patient cohort (Figure 9C).
  • gene signature i.e., the genes set forth in Table 5, may be used in place of, or in addition to, genes ADAM9, MCEMP1, MS4A4A, RAB10, GCLM, EPHX2, RORA, CFAP97, ARL4C or ACSS1 in the the methods disclosed herein.
  • the methods disclosed herein may comprise one or more of the steps of (a) identifying from the sequencing of said sample at least one single-nucleotide polymorphism (SNP) in one or more of genes set forth in Table 5; (b) measuring the level of soluble protein expressed by one or more of the genes set forth in Table 5 in a sample from the subject; (c) measuring the expression level of one or more of the genes set forth in Table 5 at the RNA level in a sample from the subject; and/or (d) measuring the expression level of one or more of the genes set forth in Table 5 at the protein level in a sample from the subject.
  • SNP single-nucleotide polymorphism
  • Example 7 ADAM9 is a major driver of ARDS in critical COVID-19 patients
  • ADAM9 A disintegrin and a metalloprotease
  • COVID-19 etiology focus was on experimentally determining the role of ADAM9 (A disintegrin and a metalloprotease) in COVID-19 etiology as (i) it is the gene with the greatest degree of causal influence in the SCM DAG, (ii) it is the only driver gene that has previously been shown to interact with SARS-CoV-2 by a global interactomics approach (Gordon et al., 2020a, 2020b) and (iii) it is an entry factor for another RNA virus, the Encephalomyocarditis Virus (Bazzone et al.,
  • ADAM9 is a metalloprotease with various functions that are either mediated by its disintegrin domain for adhesion or by its metalloprotease domain for the shedding of a large range of cell surface proteins (Chou et al., 2020).
  • the ADAM9 gene encodes two isoforms encoding respectively for a membrane bound and a secreted protein. Although neither isoform could be detected by the proteomics approach, ADAM9 was up-regulated at the RNA level and the secreted form showed a higher concentration in the plasma of critical versus non-critical patients ( Figures 10A and 10B).
  • the transcriptional up-regulation oiADAM9 was also associated with disease severity in a previously published bulk RNA-seq dataset ( Figure 11) (Arunachalam et al., 2020).
  • ELISA was used to quantify the soluble form of the MICA protein, which is known to be cleaved by ADAM9 (Kohga et al., 2010).
  • the concentration of soluble MICA was indeed significantly higher in the plasma of critical patients as compared to non-critical patients and healthy controls (Figure IOC).
  • Global eQTL analysis using whole genome sequencing and RNA-seq data showed 8 SNPs associated with three of the top five putative driver genes with genome-wide significance (Table 6).
  • rs7840270 is localized just 0.3kb upstream of the ADAM9 gene and an eQTL for blood expression reported in GTEX ( Figure 10D).
  • ADAM9 was silenced by siRNA in Vero-E6 or A549-ACE2 (Buchrieser et al., 2020) cells and subsequently infected the cells with SARS-CoV-2. Viral entry was monitored by flow cytometry quantification of the internalized nucleocapsid protein and the viral replication by quantitative viral RT-PCR in the culture supernatant (Figure 10E). The average silencing efficiency reached 66% in vero-E6 cells and 93% in A549-ACE2 cells ( Figure 12).
  • a multi-omics strategy associated with integrated AI/ML and probabilistic programming methods was used to identify pathways and signatures that can differentiate critical from non-critical patients in a population of patients below 50 years of age and without major comorbidities.
  • This in silico strategy provided a detailed view of the systemic immune response that was globally in line with previously published data.
  • a consistent transcriptomic signature that was able to robustly differentiate critical from non-critical patients, as shown by the classification performance metrics assessed was also defined (Figure 7A and Table 4). Notably, this signature can be generalized as the classification performance was shown to perform equally well in a replication cohort composed of 81 critically ill patients and 73 recovered critical patients (Figure 9).
  • RAB10 Ras-related protein Rab-10
  • MCEMP1 MCEMP1
  • MS4A4A GCLM
  • ADAM9 ADAM9
  • MCEMP1 (Mast Cell Expressed Membrane Protein 1) is a membrane protein specifically associated with lung mast cells and for which a lowered expression has been shown to reduce inflammation of septic mice (Li et al., 2005; Xie et al., 2020).
  • MS4A4A (a member of the membrane-spanning, four domain family, subfamily A) is a surface marker for M2 macrophages which mediate immune responses in pathogen clearance (Sanyal et al., 2017) and regulates arginase 1 induction during macrophage polarization and lung inflammation in mice (Sui and Zeng, 2020).
  • GCLM Glutamate-Cysteine Ligase Modifier Subunit
  • ADAM9 Disintegrin and metalloproteinase domain-containing protein 9
  • ADAM9 is the subject of cancer research, e.g ., as a target for antibody-drug-conjugate therapy of solid tumors (Sui and Zeng, 2020)
  • the data provided herein suggests a repurposing strategy using ADAM9 blocking antibodies or other therapeutic agents to reduce ADAM9 levels or activity to treat critical COVID-19 patients.
  • a feature vector is provided to a trained classifier.
  • the learning system is pre-trained using training data.
  • training data is retrospective data.
  • the retrospective data is stored in a data store.
  • the learning system may be additionally trained through manual curation of previously generated outputs. It will be appreciated that in addition to the specific examples provided above, a variety of other classifiers are suitable for use according to the present disclosure, including random decision forests, linear classifiers, support vector machines (SVM), and neural networks such as recurrent neural networks (RNN).
  • SVM support vector machines
  • RNN recurrent neural networks
  • Suitable artificial neural networks include but are not limited to a feedforward neural network, a radial basis function network, a self-organizing map, learning vector quantization, a recurrent neural network, a Hopfield network, a Boltzmann machine, an echo state network, long short term memory, a bi-directional recurrent neural network, a hierarchical recurrent neural network, a stochastic neural network, a modular neural network, an associative neural network, a deep neural network, a deep belief network, a convolutional neural networks, a convolutional deep belief network, a large memory storage and retrieval neural network, a deep Boltzmann machine, a deep stacking network, a tensor deep stacking network, a spike and slab restricted Boltzmann machine, a compound hierarchical-deep model, a deep coding network, a multilayer kernel machine, or a deep Q-network.
  • the present disclosure may be embodied as a system, a method, and/or a computer program product.
  • the computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present disclosure.
  • the computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device.
  • the computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
  • a non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD- ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read-only memory
  • EPROM or Flash memory erasable programmable read-only memory
  • SRAM static random access memory
  • CD- ROM compact disc read-only memory
  • DVD digital versatile disk
  • memory stick a floppy disk
  • a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon
  • a computer readable storage medium is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g ., light pulses passing through a fiber optic cable), or electrical signals transmitted through a wire.
  • Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network.
  • the network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers.
  • a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
  • Computer readable program instructions for carrying out operations of the present disclosure may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages.
  • the computer readable program instructions may execute entirely on the user’s computer, partly on the user’s computer, as a stand-alone software package, partly on the user’s computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer may be connected to the user’s computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
  • electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present disclosure. Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
  • These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
  • the computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s).
  • the functions noted in the block may occur out of the order noted in the figures.
  • two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Immunology (AREA)
  • Medical Informatics (AREA)
  • Wood Science & Technology (AREA)
  • Biotechnology (AREA)
  • Genetics & Genomics (AREA)
  • Public Health (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Virology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Pathology (AREA)
  • Data Mining & Analysis (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Epidemiology (AREA)
  • Medicinal Chemistry (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Biology (AREA)
  • Hematology (AREA)
  • Primary Health Care (AREA)
  • Urology & Nephrology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Communicable Diseases (AREA)
  • Artificial Intelligence (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Tropical Medicine & Parasitology (AREA)

Abstract

La présente invention concerne un procédé de traitement ou de prévention d'une maladie sévère de coronavirus 2019 (COVID-19) chez un sujet, comprenant l'administration au sujet d'une composition comprenant un agent de modulation qui diminue ou augmente l'expression ou l'activité de produit génique d'un ou de plusieurs gènes moteurs.
PCT/US2022/028326 2021-05-10 2022-05-09 Procédés pour l'identification et le traitement de formes sévères de covid-19 WO2022240743A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202163186604P 2021-05-10 2021-05-10
US63/186,604 2021-05-10

Publications (1)

Publication Number Publication Date
WO2022240743A1 true WO2022240743A1 (fr) 2022-11-17

Family

ID=84029398

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2022/028326 WO2022240743A1 (fr) 2021-05-10 2022-05-09 Procédés pour l'identification et le traitement de formes sévères de covid-19

Country Status (1)

Country Link
WO (1) WO2022240743A1 (fr)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170240968A1 (en) * 2014-08-13 2017-08-24 Brigham Young University Allelic polymorphisms associated with reduced risk for alzheimer's disease
US20200158716A1 (en) * 2017-07-17 2020-05-21 Massachusetts Institute Of Technology Cell atlas of healthy and diseased barrier tissues
WO2022066963A1 (fr) * 2020-09-25 2022-03-31 The Board Of Trustees Of The Leland Stanford Junior University Procédé pour déterminer le risque d'un sujet infecté par un virus de développer des symptômes graves

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170240968A1 (en) * 2014-08-13 2017-08-24 Brigham Young University Allelic polymorphisms associated with reduced risk for alzheimer's disease
US20200158716A1 (en) * 2017-07-17 2020-05-21 Massachusetts Institute Of Technology Cell atlas of healthy and diseased barrier tissues
WO2022066963A1 (fr) * 2020-09-25 2022-03-31 The Board Of Trustees Of The Leland Stanford Junior University Procédé pour déterminer le risque d'un sujet infecté par un virus de développer des symptômes graves

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DITE GILLIAN S., MURPHY NICHOLAS M, ALLMAN RICHARD: "Development and validation of a clinical and genetic model for predicting risk of severe COVID-19", CAMBRIDGE UNIVERSITY PRESS, 12 March 2021 (2021-03-12), XP055879720, [retrieved on 20220117] *
PINE ALEXANDER B., MEIZLISH MATTHEW L., GOSHUA GEORGE, CHANG C‐HONG, ZHANG HANMING, BISHAI JASON, BAHEL PARVEEN, PATEL AMISHA, GBY: "Circulating markers of angiogenesis and endotheliopathy in COVID‐19", PULMONARY CIRCULATION 2012 APR-JUN, vol. 10, no. 4, 1 October 2020 (2020-10-01), pages 1 - 4, XP093015138, ISSN: 2045-8940, DOI: 10.1177/2045894020966547 *
SUN CHAOYANG, BAI YONG, CHEN DONGSHENG, HE LIANG, ZHU JIACHENG, DING XIANGNING, LUO LIHUA, REN YAN, XING HUI, JIN XIN, CHEN GANG: "Accurate classification of COVID‐19 patients with different severity via machine learning", CLINICAL AND TRANSLATIONAL MEDICINE, INTERNATIONAL SOCIETY FOR TRANSLATIONAL MEDICINE, SE, vol. 11, no. 3, 1 March 2021 (2021-03-01), SE , XP093015139, ISSN: 2001-1326, DOI: 10.1002/ctm2.323 *

Similar Documents

Publication Publication Date Title
US20090258002A1 (en) Biomarkers for Tissue Status
Konigsberg et al. Molecular signatures of idiopathic pulmonary fibrosis
JP2008538007A (ja) 敗血症の診断
Dai et al. Circulating long noncoding RNAs as potential biomarkers of sepsis: a preliminary study
Sleiman et al. The gene-regulatory footprint of aging highlights conserved central regulators
US20180298455A1 (en) Risk stratification in influenza
RU2663724C2 (ru) Способы определения восприимчивости пациента к внутрибольничной инфекции и составления прогноза развития септического синдрома
Jiang et al. RNA sequencing data from neutrophils of patients with cystic fibrosis reveals potential for developing biomarkers for pulmonary exacerbations
US20180251844A1 (en) Host dna as a biomarker of crohn's disease
Yan et al. Longitudinal peripheral blood transcriptional analysis of COVID-19 patients captures disease progression and reveals potential biomarkers
CA2884737A1 (fr) Marqueurs diagnostiques du cancer de la prostate indolent
Gisby et al. Multi-omics identify falling LRRC15 as a COVID-19 severity marker and persistent pro-thrombotic signals in convalescence
WO2022125702A1 (fr) Analyse de l'expression d'un gène hôte pour le diagnostic d'une infection par le coronavirus à syndrome respiratoire aigu sévère 2
Martínez-Diz et al. Relevance of TMPRSS2, CD163/CD206, and CD33 in clinical severity stratification of COVID-19
Wang et al. Identifying hub genes of calcific aortic valve disease and revealing the immune infiltration landscape based on multiple WGCNA and single-cell sequence analysis
WO2022240743A1 (fr) Procédés pour l'identification et le traitement de formes sévères de covid-19
WO2022240746A1 (fr) Méthodes pour l'identification et le traitement de formes sévères de covid-19
US10078086B2 (en) Use of interleukin-27 as a diagnostic biomarker for bacterial infection in critically ill patients
CN114959001A (zh) 用于诊断急性心肌梗死的产品及其应用
Kufova et al. Biomarkers in immunoglobulin light chain amyloidosis
Carapito et al. Identification of driver genes for severe forms of COVID-19 in a deeply phenotyped young patient cohort
Bernardes et al. Longitudinal multi-omics analysis identifies responses of megakaryocytes, erythroid cells and plasmablasts as hallmarks of severe COVID-19 trajectories
KR20230105692A (ko) 말초 적혈구로부터 소핵 dna 및 이의 용도
Giannini et al. Genetics of acute respiratory distress syndrome: pathways to precision
Chen et al. Whole-genome microRNA sequencing analysis in patients with pulmonary hypertension

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22808122

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22808122

Country of ref document: EP

Kind code of ref document: A1