EP4347891A1 - Biomarkers and methods for classifying subjects following viral exposure - Google Patents

Biomarkers and methods for classifying subjects following viral exposure

Info

Publication number
EP4347891A1
EP4347891A1 EP22731779.9A EP22731779A EP4347891A1 EP 4347891 A1 EP4347891 A1 EP 4347891A1 EP 22731779 A EP22731779 A EP 22731779A EP 4347891 A1 EP4347891 A1 EP 4347891A1
Authority
EP
European Patent Office
Prior art keywords
disease
biomarker
panel
gene
genes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP22731779.9A
Other languages
German (de)
French (fr)
Inventor
Alexander James MANN
Gareth GUENIGAULT
Aruna BASAL
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Poolbeg Pharma UK Ltd
Original Assignee
Poolbeg Pharma UK Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Poolbeg Pharma UK Ltd filed Critical Poolbeg Pharma UK Ltd
Publication of EP4347891A1 publication Critical patent/EP4347891A1/en
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/569Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
    • G01N33/56983Viruses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/118Prognosis of disease development
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Definitions

  • the present invention relates to biomarkers for predicting whether a subject will develop acute symptoms or signs of disease following exposine, or possible exposure, to a respiratory virus such, for example, as an influenza virus.
  • a respiratory virus such, for example, as an influenza virus.
  • the present invention provides methods for predicting whether a subject will develop a severe or complicated form of disease.
  • the invention includes methods of conducting clinical trials or field studies comprising analysing the biomarkers, but more generally, the biomarkers of the invention may be used in any healthcare or non-healthcare setting; for example to triage patients infected with a respiratory virus to identify those who are susceptible to developing acute signs or symptoms and may therefore require medical intervention.
  • Subjects may have been administered a medicinal product for treatment or prevention of respiratory disease, and the biomarkers of the invention may therefore be used as a companion analytical product to predict the likely efficacy of the medicinal product.
  • the present invention further provides computer programmes, computer readable media, computer implemented-methods and classification algorithms that generate or utilise the biomarkers of the invention.
  • RNA viruses such as respiratory syncytial virus (RSV), influenza virus, parainfluenza virus, metapneumovirus, rhinovirus (HRV) and coronavirus (Hodinka, “Respiratory RNA Viruses”, Microbiol Spectr., 2016 Aug; 4(4)).
  • Influenza infects all age groups and causes a range of outcomes from asymptomatic infection and mild respiratory disease through to severe respiratory disease and even death.
  • different subjects exposed to the same influenza virus may be asymptomatic, mildly symptomatic, subclinical, exhibit acute symptoms, or require medical attention, or even urgent hospitalization (Cox et al 1999).
  • the proportion of infections that are asymptomatic or subclinical, and the degree to which these are contagious, as well as the proportion of shedding which occurs prior to onset of symptoms affect the potential impact of control measures and decisions regarding treatment and the administration of medicaments (Lau et al., 2010).
  • Woods et al., “A Host Transcriptional Signature for Presymptomatic Detection of Infection in Humans Exposed to Influenza H1N1 or H3N2”, PLOS ONE, January 2013; 8(1): e52198 describe the generation of a viral gene signature (or factor) for symptomatic influenza that is capable of detecting 94% of infected cases.
  • Woods et al. discloses methods for identifying a subject infected with a respiratory virus prior to presentation of symptoms, such methods do not predict whether or not an individual will develop acute symptoms of an influenza-like disease.
  • the present invention provides a biomarker for predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposine, to a respiratory virus, wherein the biomarker comprises or is derived from expression levels of one or more genes selected from a gene panel comprising PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, H0MER3, NSUN6, EPHA4 and BMP2K measured in a biological sample obtained from the subject after exposure, or possible exposure, to a respiratory virus.
  • a gene panel comprising PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, H0MER3, NSUN6, EPHA4 and BMP2K measured in a biological sample obtained from the subject after exposure, or possible exposure, to a respiratory virus.
  • a biomarker in the context of the present invention is a measurable indicator of biological state or condition, in particular, an output to predict whether a subject will develop acute symptoms of disease.
  • the output may be a numerical output.
  • the biomarker of the invention may be a composite biomarker comprising expression levels of two or more genes of the gene panel or the expression level of at least one gene of the gene panel in combination with at least one other factor as described herein.
  • the subject may be a human or non-human mammal.
  • the acute symptoms of disease may consist of symptoms of an influenza-like or other respiratory disease, as disclosed herein.
  • Acute symptoms of an influenza-like or other respiratory disease means that the subject experiences four or more of the following symptoms and these symptoms, either individually or combined, interfere with daily activities.
  • the symptoms include runny nose, stuffy nose, sore throat, sneezing, earache, cough, shortness of breath, wheezing, chest tightness, headache, malaise, myalgia, muscle and/or joint aches, elevated temperature, chilliness, and feverishness.
  • the elevated temperature may be a temperature of 38 °C or more, optionally experienced together with a cough, optionally with onset within the last 8-12 (e.g. 10) days.
  • a subset of subjects will have acute symptoms, these subject’s symptoms will score in the 85 th percentile, for example the subject’s total VAS or CAT score will be in the 85 th percentile.
  • a total VAS score of greater than or equal to 25 units, or a CAT score greater than or equal to 10 units equates to acute symptoms.
  • a subject may be identified as susceptible to progression to a complicated form of an influenza-like or other respiratory disease.
  • a complicated respiratory disease such as influenza is defined as disease requiring hospital admission and/or with symptoms and signs of lower respiratory tract infection (hypoxaemia, dyspnoea, lung infiltrate), central nervous system involvement and/or a significant exacerbation of an underlying medical condition.
  • a subject is predicted to develop acute symptoms of an influenza-like or other respiratory disease, such as complicated flu or other respiratory disease, then in accordance with the present invention, it is predicted that the subject will go on to exhibit acute symptoms as defined above. It may be predicted that a subject will develop acute symptoms of an influenza-like or other respiratory disease, but the subject may self-resolve, or action may be taken to prevent the subject developing acute symptoms; for example a medicament may be administered. Thus, a subject predicted to develop acute symptoms of an influenza-like or other respiratory disease does not inevitably develop acute symptoms of an influenza- like or other respiratory disease.
  • Respiratory virus includes all viral infections of the respiratory tract including respiratory syncytial virus (RSV), parainfluenza virus (HPIV), metapneumovirus (HMPV), rhinovirus (HRV), coronavirus, adenovirus (HAdV), enterovirus (EV), bocavirus (HBoV), parechovirus (HPeV), influenza including influenza A and influenza B.
  • RSV respiratory syncytial virus
  • HPIV parainfluenza virus
  • HPIV metapneumovirus
  • HRV coronavirus
  • HdV adenovirus
  • EV enterovirus
  • bocavirus bocavirus
  • parechovirus HPeV
  • influenza including influenza A and influenza B.
  • a gene panel in the context of the present invention is a set of genes the expression levels of which can be analysed and used to predict the progression and/or outcome of an influenza-like or other respiratory disease in a subject.
  • a gene sub-panel is a set of genes selected from a gene panel which may be used to predict at a certain stage in the progression of the disease, for example early, mid or late stage progression towards possibly developing acute symptoms of an influenza like or other respiratory disease, or at a certain time point following inoculation with, or exposine to, a respiratory virus, for example up to 25 hours (for example 13-25 hours), or 37-49 hours, or 49-61 hours, in some embodiments these time frames may be referred to as early, mid or late stage respectively.
  • it may not be possible to determine when a subject was exposed to a respiratory virus in addition some subjects develop symptoms quicker than other subjects, and so disease progression and stage of disease progression may be estimated based on evaluated symptoms.
  • the gene panel of the present invention may include one or more genes (including two genes, three genes, four genes, five genes, six genes, etc.) selected from PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
  • the gene panel may consist of up to 16 genes (typically up to 10 genes, and more typically up to 6 genes) including one or more genes (including two genes, three genes, four genes, five genes, six genes, etc.) selected from PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
  • genes including two genes, three genes, four genes, five genes, six genes, etc.
  • the gene panel may comprise PHF20. In some embodiments, the gene panel may comprise NOL9. In some embodiments, the gene panel may comprise both PHF20 and NOL9.
  • the gene panel of the present invention may consist of one, two, three, four, five or six genes selected from PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
  • the gene panel of the present invention may consist of one, two, three, four, five or six genes, including PHF20 and optionally NOL9.
  • a first gene sub-panel may comprise PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2.
  • the first gene sub-panel may comprise the expression level of PHF20.
  • the first gene sub-panel may comprise the expression level of one or both of APBA2 and ABCA1.
  • the first gene sub-panel may additionally comprise the expression level of one, two or three of M0RC2, SNU13 and DCUN1D2.
  • the first gene sub-panel may consist of one, two, three, four, five, or six of PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2.
  • the first gene sub-panel may consist of one gene, which is PHF20.
  • the first gene sub-panel may consist of two genes, one of which is PHF20.
  • the first gene sub-panel may consist of three genes, one of which is PHF20.
  • the first gene sub-panel may consist of four genes, one of which is PHF20.
  • the first gene sub-panel may consist of five genes, one of which is PHF20.
  • the first gene sub-panel may consist of six genes, one of which is PHF20.
  • the first gene sub-panel may consist of two genes, one of which is PHF20, and the other of which is APBA2 or ABCA1.
  • the first gene sub-panel may consist of three genes, including PHF20, accompanied by one or both of APBA2 and ABCA1.
  • the first gene sub-panel may consist of four genes, including PHF20, accompanied by one or both of APBA2 and ABCA1.
  • the first gene sub-panel may consist of five genes, including PHF20, accompanied by one or both of APBA2 and ABCA1.
  • the first gene sub-panel may consist of six genes, including PHF20, accompanied by one or both of APBA2 and ABCA1.
  • a second gene sub-panel may comprise MAX, NOL9, MPRIP, HP, BST1 and TM9SF2.
  • the second gene sub-panel may comprise the expression level of one or more of NOL9, HP and MAX (particularly NOL9).
  • the second gene sub-panel may comprise the expression level of one or both of BST1 and MPRIP.
  • the second gene sub-panel may additionally comprise the expression level of TM9SF2.
  • the second gene sub-panel may consist of one, two, three, four, five, or six of MAX, NOL9, MPRIP, HP, BST1 and TM9SF2.
  • the second gene sub-panel may consist of one gene, which is NOL9, HP or MAX (particularly NOL9).
  • the second gene sub-panel may consist of two genes, one or both of which are selected from NOL9, HP and MAX (particularly NOL9).
  • the second gene sub-panel may consist of three genes, one, two or all of which are selected from NOL9, HP and MAX (particularly NOL9).
  • the second gene sub-panel may consist of four genes, one, two or three of which are selected from NOL9, HP and MAX (particularly NOL9).
  • the second gene sub-panel may consist of five genes, one, two or three of which are selected from NOL9, HP and MAX (particularly NOL9).
  • the second gene sub-panel may consist of six genes, one, two or three of which are selected from NOL9, HP and MAX (particularly NOL9).
  • the second gene sub-panel may consist of two genes, one of which is NOL9, HP or MAX (particularly NOL9) and the other of which is BST1 or MPRIP.
  • the second gene sub-panel may consist of three genes, including one or more of NOL9, HP or MAX (particularly NOL9), accompanied by one or both of BST1 and MPRIP.
  • the second gene sub-panel may consist of four genes, including one or more of NOL9, HP or MAX (particularly NOL9), accompanied by one or both of BST1 and MPRIP.
  • the second gene sub-panel may consist of five genes, including one or more of NOL9, HP or MAX (particularly NOL9), accompanied by one or both of BST1 and MPRIP.
  • the second gene sub-panel may consist of six genes, including one or more of NOL9, HP or MAX (particularly NOL9), accompanied by one or both of BST1 and MPRIP.
  • a third gene sub-panel may comprise HOMER3, NSUN6, HP, EPHA4 and BMP2K.
  • the third gene sub-panel may comprise the expression level of one or both of HP and HOMER3.
  • the third gene sub-panel may comprise the expression level of one or both of EPHA4 and BMP2K.
  • the third gene sub-panel may additionally comprise the expression level ofNSUN6.
  • the third gene sub-panel may consist of one, two, three, four or five of H0MER3, NSUN6, HP, EPHA4 and BMP2K.
  • the third gene sub-panel may consist of one gene, which is HP or H0MER3.
  • the third gene sub-panel may consist of two genes, one or both of which are selected from HP and HOMER3.
  • the third gene sub-panel may consist of three genes, one or two of which are selected from HP and HOMER3.
  • the third gene sub-panel may consist of four genes, one or two of which are selected from HP and H0MER3.
  • the third gene sub-panel may consist of five genes, one or two of which are selected from HP and HOMER3.
  • the third gene sub-panel may consist of two genes, one of which is HP or H0MER3; and the other of which is EPHA4 or BMP2K.
  • the third gene sub-panel may consist of three genes, including one or both of HP and H0MER3, accompanied by one or both of EPHA4 and BMP2K.
  • the third gene sub- panel may consist of four genes, including one or both of HP and H0MER3, accompanied by one or both of EPHA4 and BMP2K.
  • the third gene sub-panel may consist of five genes, including one or both of HP and H0MER3, accompanied by one or both of EPHA4 and BMP2K.
  • any of the aforementioned gene panels may further comprise 1 to 2 genes in addition to those listed above, without departing from the essential character of the panels of the present disclosure.
  • the genes that have been identified as being predictive of a subject developing acute symptoms of an influenza like or other respiratory disease exhibit altered expression levels following inoculation with a virus in subjects who then go on to exhibit acute symptoms relative to those who do not develop acute symptoms, as defined above. This indicates the potential of the genes to identify subjects who are more likely to develop acute symptoms of an influenza-like or other respiratory disease. Since symptoms of viral infection develop sooner in some subjects than in others, altered expression of the one or more genes according to the present invention may be predictive of acute symptoms, before a subject shows any symptoms of infection, or an early diagnostic indicator of acute symptoms at about the same time as the subject starts to show the first symptoms of infection.
  • the expression levels of the one or more genes in the biological sample may be measured using any suitable method known in the art for quantifying the expression level of a gene, particularly a mammahan gene.
  • the expression level of the one or more genes may be measured by quantifying mRNA transcripts of the one or more genes according to the invention in the biological sample.
  • a PCR-based method may be used such, for example, as RT-qPCR.
  • RT-qPCR-based methods are disclosed by United States patent no. 7,101,663, the contents of which are incorporated herein by reference.
  • An advantage of real-time PCR is its relative ease and convenience of use.
  • a gene expression microarray may be used of the kind disclosed in, for example, United States patent no. 6,040,138, the contents of which are incorporated herein by reference, in which a pool of labelled target cRNA molecules, which are obtained by transcribing double-stranded cDNA derived from the mRNA transcripts that are isolated from the biological sample and fragmenting the resulting cRNA transcripts, are hybridised to oligonucleotide probes having specific sequences that are immobilised at specific addresses on a solid support. After incubating the cRNA targets with the surface- bound probes, the arrays are washed and the labels on the targets may be used to quantify how much target is bound to any given feature on the array. The amount of a given surface-bound target cRNA is proportional to the expression level of the corresponding gene.
  • RNA-seq may be used to quantify, discover and profile RNAs. This uses next- generation sequencing on cDNA converted from RNA (Wang et al 2009).
  • the biological sample may be a blood or a respiratory sample.
  • the sample may be a sample containing immune cells.
  • the expression level of each of the one or more genes may be compared with a respective reference level.
  • the reference level may be a threshold expression level that indicates acute symptoms of an influenza -like or other respiratory disease or a prediction of acute symptoms developing.
  • the reference level may be a baseline level of expression which indicates that the subject is unlikely to develop acute symptoms of an influenza-like or other respiratory disease.
  • the method may involve an individual reference level for each gene. Altered expression of at least one of the genes, preferably two or more of the genes, relative to their respective reference levels may indicate acute symptoms of an influenza like or other respiratory disease or predict developing acute symptoms of an influenza like or other respiratory disease in accordance with the present invention.
  • the reference level for the, or each, gene may be a previously measured expression level for the gene in the same subject.
  • the reference level for the, or each, gene may comprise a baseline expression level of the gene for the subject which is measured at a time when the subject is known not to be infected with a respiratory virus such, for example, as influenza.
  • a respiratory virus such, for example, as influenza.
  • the reference level for each gene may comprise an average of multiple previous levels.
  • a subject may be tested once to obtain baseline levels for the one or more genes, which form reference levels that may be used subsequently in case of suspected viral infection or a routine check, for comparison with contemporaneous expression levels to predict whether or not the subject is likely to develop acute symptoms of an influenza like or other respiratory disease.
  • Exposure to a respiratory virus includes any contact or possible contact with a respiratory virus including, exposure to a community acquired respiratory viral infection, exposure to respiratory virus at home, within a care home, hospital or military setting. Exposure also includes inoculation of subjects during a human challenge model and/or clinical trial.
  • a single gene panel containing one or more genes, may be analysed using a first algorithm.
  • a combination of gene panels and gene-sub panels may be analysed.
  • the different gene panels and gene sub-panels may be analysed simultaneously, for example using the same biological sample or samples obtained within a similar, or the same, time fame.
  • the different gene panels and gene sub-panels may be analysed sequentially with a first gene panel or gene sub-panel analysed in a sample taken at a first time point using a first algorithm, and a second gene panel or gene sub-panel analysed in a sample taken at a second time point using a second algorithm, and a third gene panel or gene sub-panel analysed in a sample taken at a third time point using a third algorithm etc.
  • the biomarker of the invention may be based on a number of input variables or factors, including gene expression levels, for example the age of the subject, or other underlying conditions that a subject may suffer e.g. asthma, may be included in the variables used to calculate the biomarker.
  • the biomarker may therefore be a composite biomarker.
  • the output of the biomarker may be a numerical value.
  • the numerical value may be determined using a threshold, reference level, or baseline level, for example a numerical value above a certain reference level predicts that a subject will develop acute symptoms of an influenza-like or other respiratory disease.
  • the biomarker may be computer-generated and comprises an output variable of a classification algorithm that uses as input variables the expression levels of one or more genes in the gene panel; or one or more genes in the first gene sub-panel; or one or more genes in the second gene sub-panel; or one or more genes in the third gene sub-panel.
  • the classification algorithm may be configured to prioritise accuracy such that the algorithm produces the greatest number of correct predictions.
  • the classification algorithm may be configured to prioritise Negative Predictive Value (NPV), the proportion of negative test results that are true negatives, the aim being to minimise the number of subjects predicted not to develop acute symptoms of influenza-like disease who in fact go on to develop acute symptoms of an influenza-like or other respiratory disease.
  • NPV Prioritise Negative Predictive Value
  • the classification algorithm may be derived by machine-learning fiom a training data-set that uses as input variables expression levels of one or more genes fiom the gene panel measured fiom a biological sample obtained fiom a group of subjects at a predetermined time after exposure to the respiratory virus, wherein the group of subjects is divided into two classes according to whether or not they developed acute symptoms of an influenza-like or other respiratory disease after exposure to the respiratory virus, and wherein the classification algorithm operates on the expression levels to produce an output variable that differentiates between the classes.
  • classification algorithms are available to those skilled in the art for classifying subjects into two or more classes based on their symptoms scores.
  • numerous machine learning techniques are available for using a training dataset comprising the two or more classes and their respective expression levels for the one or more genes to derive a classification algorithm that is able to classify a new subject based on their expression levels of the one or more genes.
  • the performance of a classification algorithm built using a machine learning process may be validated using one or more known validation methods, e.g. cross-validation, and calculating statistical parameters (e.g. accuracy, sensitivity, specificity) so that the person skilled in the art can obtain a classification algorithm that is best suited for classifying subjects based on their expression levels of the one or more genes.
  • the acute symptoms of an influenza-like or other respiratory disease in the subjects of the group in the training data set may be assessed by evaluating one or more symptoms of influenza-like or other respiratory disease at a series of pre-set times after exposine to the respiratory virus.
  • the one or more symptoms are evaluated by the subjects using diary cards, optionally visual analogue scare symptom diary cards (VAS), or optionally categorical symptoms (CAT) are recorded using a modified standardized symptom score for example the modified Jackson Score.
  • the symptoms evaluated may include runny nose, stuffy nose, sore throat, sneezing, earache, cough, shortness of breath, headache, malaise, myalgia, muscle and/or joint aches, chilliness, and feverishness.
  • the first class of subjects may record a total VAS of greater than or equal to 25 units and/or a total CAT score of 10 units or greater, or may show one or more of: greatest variance in total VAS or CAT up to the peak of symptoms; greatest variance in total VAS or CAT over the duration of quarantine; or steepest gradient (slope of regression line) of total VAS or CAT up to the peak of symptoms.
  • machine learning processes and the resulting classification algorithms may be carried out using a computer.
  • the gene panels and gene sub-panels may be selected by i) analysing expression levels in biological samples obtained from the group of subjects in the data training set across the whole series of pre-set times after exposure to the virus; and ii) identifying genes that show a nominal association with acute symptoms of an influenza-like or other respiratory disease, and iii) using a variable selection process to select panels of the identified genes whose expression levels at a predetermined time after exposure to the virus exhibit maximal predictive value for developing acute symptoms of an influenza- like or other respiratory disease.
  • the variable selection process may comprise subjecting the expression levels of the identified genes at the predetermined time after exposure to the respiratory virus to a repeated gradient boosting process and selecting a set of 1, 2, 3, 4, 5 or 6 genes that are selected most frequently by the gradient boosting process.
  • a variable selection process is illustrated in FIG. 6 of the accompanying drawings and is performed by a gradient boosting machine (GBM ; Friedman 2001; Friedman 2002).
  • GBM gradient boosting machine
  • differential expression analysis of genes between subjects that developed acute symptoms of influenza-like or other respiratory disease and subjects that did not develop acute symptoms of influenza-like or other respiratory disease was performed by application of a cubic p-spline model.
  • Nominal associations arising from the cubic spline analysis were input into a variable selection process comprising gradient boosting machine, and iterative searches were conducted using fifty starting point (seeds), to determine the best gene predictors of developing acute symptoms of influenza-like or other respiratory disease.
  • the biomarker may be used to allocate subjects to groups in a clinical trial or to make treatment decisions. Subjects allocated to one subgroup are administered a medicament while those allocated to another subgroup are not administered a medicament or do not receive a medicament until later in the trial or study.
  • the biomarker may also be used to monitor the efficacy of a medicament by assessing whether a subject to whom the medicament has been administered is likely to develop acute symptoms of disease after exposure, or possible exposure, to a respiratory virus.
  • the medicament may comprise a therapeutic agent or a preventative agent such, for example, as a vaccine.
  • the present invention provides a method of predicting whether a subject will develop acute or complicated symptoms of disease after exposure, or possible exposure, to a respiratory virus which comprises analysing a biomarker according to the invention and comparing the biomarker to a reference for the biomarker.
  • the present invention also provides a method of conducting a clinical trial or field study in which a group of subjects are exposed to a respiratory virus, the method comprising analysing a biomarker according to the invention for each subject and comparing the biomarker to a reference for the biomarker to predict whether the subject is likely to develop acute symptoms of disease, and including subjects who are predicted to develop acute symptoms of disease in a first subgroup of the clinical trial or field study and including subjects who are predicted not to develop acute symptoms of disease in a second subgroup.
  • the methods of the invention may include comparing the biomarker to a reference for the biomarker or to a baseline for the biomarker.
  • the baseline for the biomarker may be determined at a time when the subject is known not to be infected with a respiratory virus.
  • the disease may be an influenza-like or other respiratory disease.
  • a medicament includes all substances used for medical treatment and includes vaccines, drugs, placebos and investigational medicaments, for example investigational medicaments that are the subject of a clinical trial. Therefore medicament includes licensed, unlicensed and investigational medicaments. Medicament also includes products that already have a marketing authorisation but that are being tested for a different use, or for efficacy when assembled in a different way, or tested to gain further information about the authorised use.
  • the invention also provides a computer program for predicting whether a subject will develop acute symptoms of disease such, for example, as an influenza-like disease after exposure, or possible exposure, to a respiratory virus, which comprises instractions which, when the program is executed by a computer, cause the computer to generate a biomarker according to the invention.
  • the invention further provides a classification algorithm for predicting whether a subject will develop acute symptoms of disease such for example as an influenza-like or other respiratory disease after exposure, or possible exposure, to a respiratory virus, wherein the classification algorithm is derived by analysing expression levels of one or more genes in subjects who have developed acute symptoms of disease and comparing with the expression levels in subjects who do not develop acute symptoms of disease, wherein the one or more genes are PHF20, AB CAI, APBA2, M0RC2, SNU13 DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
  • the classification algorithm is derived by analysing expression levels of one or more genes in subjects who have developed acute symptoms of disease and comparing with the expression levels in subjects who do not develop acute symptoms of disease, wherein the one or more genes are PHF20, AB CAI, APBA2, M0RC2, SNU13 DCUN1D2, MAX, NOL9, MPRIP, HP, BST1,
  • the invention further provides a computer readable medium and/or computer program comprising instructions which, when executed by a computer, cause the computer to carry out the classification algorithm according to the invention.
  • the invention also provides a computer-implemented method for predicting whether a subject will develop acute symptoms of disease such for example as an influenza-like disease, wherein a biomarker is generated by analysing expression levels of one or more genes in subjects who have developed acute symptoms of disease following inoculation with a respiratory virus and comparing with the expression levels in subjects who do not develop acute symptoms of disease following inoculation with a respiratory virus, wherein the one or more genes are PHF20, ABCA1, APBA2, MORC2, SNU13 DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
  • a biomarker is generated by analysing expression levels of one or more genes in subjects who have developed acute symptoms of disease following inoculation with a respiratory virus and comparing with the expression levels in subjects who do not develop acute symptoms of disease following inoculation with a respiratory virus, wherein the one or more genes are PHF20, ABCA1, APBA2, MORC
  • the invention also provides a computer-implemented method, wherein the method comprises a graphical user interface which displays the biomarker to the user. It is also contemplated that the computer-implemented aspects of the invention may be carried out by more than one computer e.g. two or more computer operating in different locations. Two or more computers may communication via a data channel, for example the internet.
  • the invention also provides a method of predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposure, to a respiratory virus, which comprises estimating time elapsed after the exposure, or possible exposure, to the respiratory virus by analysing expression levels of one or more genes selected from PHF20, AB CAI, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, H0MER3, NSUN6, EPHA4, in a biological sample obtained from the subject; selecting a biomarker described herein, which at said time exhibits maximal predictive value for developing acute symptoms of disease; and comparing the biomarker to a reference for the biomarker.
  • the time elapsed may be estimated to be about a day (i.e. about 23-26 hours, e.g. 25 hours) after exposine, or possible exposure, to the respiratory virus.
  • the selected biomarker may comprise expression levels of one or more genes from the first gene sub-panel described herein.
  • the time elapsed may be estimated to be about 1.5-2 days (i.e. about 37-49 hours) after exposure, or possible exposure, to the respiratory virus. If so, the selected biomarker may comprise expression levels of one or more genes from the second gene sub-panel described herein.
  • the time elapsed may be estimated to be he about 2-2.5 days (i.e. about 49-61 hours) after exposure, or possible exposure, to the respiratory virus. If so, the selected biomarker may comprise expression levels of one or more genes from the third gene sub-panel described herein.
  • the invention also provides a kit for use in a method according to the invention.
  • the kit may comprise one or more reagents that allow detection, optionally quantitation, of one or more nucleotides, or one or more peptides, corresponding to one or more genes from the gene panel or the first, second or third gene sub-panel described herein.
  • the kit may be for detection of one or more analytes in, or extracted from, a biological sample, such as, but not limited to, a blood, serum, plasma, mine, saliva, tissue biopsy, stool, sputum, skin, nose or throat sample.
  • the kit may comprise a device for conducting an assay, such as a lateral flow assay.
  • the device may be configured for autonomous use by a patient (without assistance from a physician), for example in the home (as opposed to a hospital or other medical facility).
  • the device may comprise a strip of porous material, which is capable of supporting capillary flow, wherein there is a zone for receiving a sample; a zone comprising a reagent for detection of an analyte; a detection zone; and a control zone.
  • a reaction between the reagent and the analyte may be detected in the detection zone, for example by a change in colour of material in the detection zone.
  • the control zone may serve as a reference against which to benchmark the reaction detected in the detection zone.
  • the device may comprise more than one test strip, for example the device may comprise two, three, four, five or six test strips, in communication with the same or separate receiving zones.
  • FIG. 1 is a chart showing how a subject is processed through a typical clinical study as described in Example 1 below.
  • FIG. 2 is a graph showing peak level of Total VAS for infected (left) and non-infected (right) subjects.
  • FIG. 3 is a graph showing change in maximum variance of VAS scores demonstrating that the four individuals with acute symptoms of influenza-like disease at peak all experienced a change in variance of Total VAS greater than 30 units.
  • FIG. 4 is a graph showing peak level of peak categorical scores for infected (left) and non- infected (right) subjects across three studies (hVIVO-l, Duke-1, Duke-2).
  • FIG. 5 is a principal component analysis to show greater homogeneity after imposing adjustment for study.
  • FIG. 6 is a flow chart to demonstrate the variable selection process performed by gradient boosting machines.
  • FIG. 7 is a flow diagram to demonstrate scenarios in which 1 to 3 of the algorithms are run in parallel to assign subjects to groups for actioning (e.g. dosing with a medicament, additional clinical assessments).
  • the scenarios include: within the human viral challenge model, field study, or in the community, wherein subjects are exposed to a respiratory virus.
  • FIG. 8 is a flow diagram to demonstrate scenarios in which each gene algorithm is used sequentially to assign subjects to groups for actioning (e.g. dosing with a medicament, additional clinical assessments) within the human viral challenge model.
  • AfiymetrixTM HG-U133 Plus 2.0 microarray across the whole quarantine post inoculation were used to perform transcriptomics analysis.
  • Differential expression analysis between subjects developing acute symptoms of an influenza-like disease and subjects that did not develop acute symptoms of an influenza-like disease was performed by application of a cubic p-spline model. Nominal associations arising from the cubic spline analysis were input into a variable selection process to determine the best six gene predictors of acute symptoms of influenza-like disease at Day 1 morning, Day 2 morning, and Day 2 evening after inoculation. These genes could be used to distinguish between subjects that developed acute symptoms of an influenza-like disease and subjects that did not develop acute symptoms of an influenza- like disease within the model at different times after exposine to virus.
  • influenza A was instilled into bilateral nares of subjects using standard pipetting methods.
  • the volunteers had clinical measurements and samples taken until discharged from quarantine and then at each follow up visit.
  • hVIVO-1 33 of the 60 subjects became infected after inoculation (evidenced by confirmed viral shedding), 25 were identified as not infected and 2 inconclusive. An interim analysis was performed after the first 27 were inoculated and all samples for each subject were sent for gene microarray assays. One of the 27 subjects did not complete the quarantine and so was excluded from analysis. Of the 26 subjects with viable microarray data, 13 were identified as confirmed as infected and 11 as not infected, 2 were inconclusive.
  • Subjects self-assessed their symptoms three times daily throughout quarantine on both categorical and continuous (Visual Analogue Scale, VAS) symptom diary cards.
  • Categorical symptoms were recorded using a modified standardized symptom score.
  • the modified Jackson Score requires subjects to rank 10 symptoms consisting of: upper respiratory tract symptoms (runny nose, stuffy nose, sore throat, sneezing, and earache), lower respiratory symptoms (cough and shortness of breath) and systemic symptoms (headache, myalgia, and muscle and/or joint aches) on a scale of 0-3 of “no symptoms”, “just noticeable”, “bothersome but can still do activities” and “bothersome and cannot do daily activities”.
  • RNA PAXGeneTM collection tubes were collected into RNA PAXGeneTM collection tubes. This occurred once on Day -1, in the morning on the day of inoculation (approximately 5 hours before inoculation) followed by every 12 hours for hVIVO-1 and Duke-2 and every 8 horns for Duke-1 for the remainder of the quarantine.
  • PCA Principal Components Analysis
  • FIG. 6 shows the process that was followed.
  • Table 3 shows the variables selected by gradient boosting
  • Table 4 shows the signature arising from logistic regression and lastly
  • Table 5 shows the test performance characteristics at all time- points considered. It can be seen that the signature, or gene sub-panel, performs well (AUG > 0.80) on Day 1 data.
  • This signature includes the genes PHF20, ABCA1, APBA2, M0RC2, SNU13, and DCUN1D2.
  • Table 6 shows the variables selected by gradient boosting
  • Table 7 shows the signature arising from logistic regression and lastly
  • Table 8 shows the test performance characteristics at all time- points considered. The signature performs well at Day 2 (AM).
  • This signature, or gene sub-panel includes the genes MAX, N0L9, MPRIP, HP, BST1,
  • Table 9 shows the variables selected by gradient boosting
  • Table 10 shows the signature arising from logistic regression and lastly
  • Table 11 shows the test performance characteristics at all time- points considered.
  • This signature includes the genes H0MER3, NSUN6, HP, EPHA4,
  • a group of individuals are recruited into a human viral challenge model and inoculated with a respiratory virus. It is beneficial to identify, in advance, subjects who will progress to develop acute symptoms of an influenza like disease, allowing selective dosing of these subjects with an investigational or licensed medicament (drug/vaccine/ placebo) at the earliest opportunity. This will improve the ability to detect a clinically relevant reduction in disease in response to the medicament by only evaluating the medicament effects in individuals that will/would have developed acute symptoms of an influenza-like disease. This will also reduce unnecessary exposure of subjects to an investigational medicament. This will also reduce the amount of medicament required.
  • Volunteers are screened for eligibility for the evaluation of efficacy of an investigational medicament in a human challenge study with a respiratory virus, in particular with influenza.
  • Eligible volunteers arrive at the clinic and baseline samples and clinical measures are taken, before they are exposed to virus (e.g. inoculation). Baseline values are obtained pre-inoculation using one or more blood samples over varying time-points.
  • Blood samples are taken regularly before and after virus exposure (e.g. paxgene RNA samples twice, three times a day, or more) alongside clinical measures of their disease.
  • virus exposure e.g. paxgene RNA samples twice, three times a day, or more
  • Expression levels of specific gene panels and sub-panels are measured in real-time from the blood paxgenes.
  • blood is assessed for gene expression utilising Affymetrix HG- U133 Plus 2.0 microarray chips, which were used to measure the transcripts’ expression.
  • Microarray data was pre-processed using RMA background correction and quantiles normalization.
  • Three separate gene subpanels can be used (i.e. 3 algorithms) to identify which individuals will develop acute symptoms of an influenza-like disease a.
  • the 3 gene sub panels are: i.
  • Subpanel A PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2 ii.
  • Subpanel B MAX, NOL9, MPRIP, HP, BST1 and TM9SF2 iii.
  • Subpanel C H0MER3, NSUN6, HP, EPHA4 and BMP2K
  • the different gene subpanels are used at different time points (FIG. 8, sub panel A, followed by sub panel B, followed by subpanel C).
  • subpanels sequentially 1, 2, or 3 gene sub panels are used at the same time (FIG. 7), and may be repeated at several points following inoculation.
  • a positive result for any test immediately triggers dosing of the subject with the investigational medicament (drug/vaccine/placebo).
  • two positive results, or three positive results, are required to trigger dosing.
  • the stringency can be varied by modifying the threshold at which a positive result is obtained.
  • the threshold for gene sub-panel 1 could be set to be more stringent avoiding false positives.
  • Gene sub-panel 2 is then set with lower stringency and gene sub- panel 3 with even lower stringency, thus increasing the chances of identifying and dosing all subjects who will develop acute symptoms of influenza-like disease as early as possible.
  • results may be combined with a diagnostic test that confirms the subject has the respiratory viral infection relevant to the trial (e.g. a viral test).
  • a diagnostic test that confirms the subject has the respiratory viral infection relevant to the trial (e.g. a viral test).
  • results may be combined with measurements of the change in variance/gradient of symptoms.
  • Other actions that can be triggered alongside dosing with a medicament include increasing the observations/samples/measurements in those predicted to develop acute symptoms of an influenza-like disease or reducing observations/samples/measurements in those who are predicted not to develop acute symptoms of an influenza-like disease.
  • the subjects may be dosed at a predetermined time point post exposine or inoculation (e.g. Day 4). These subjects form a further subgroup for analysis.
  • a group of individuals are recruited into an efficacy field study and become infected in the community with a respiratory virus, in particular influenza. Following exposure, it would be beneficial to identify, in advance, subjects who will develop acute symptoms of an influenza like disease, allowing selective dosing of these subjects with an investigational or licensed medicament (drug/vaccine/ placebo) at the earliest opportunity. This would improve ability to detect clinically relevant reduction in disease by only evaluating the medicament effects in individuals that would have gone on to present with acute symptoms of an influenza-like disease. This will reduce unnecessary exposine of subjects to an investigational medicament as well as reducing the amount of medicament required.
  • Eligible volunteers arrive at the clinic and baseline samples and clinical measures are taken as they are enrolled in the study. Baseline values would be obtained using one or more blood samples over varying time-points post enrolment and prior to contracting a respiratory virus infection in the community.
  • Blood samples e.g. paxgene RNA samples twice, three times a day, or more
  • blood samples are taken after virus exposure from a household contact that has a respiratory infection, or after showing initial symptoms of respiratory disease (the trial subjects may or may not be using a study questionnaire/symptom diary card that captures these symptoms).
  • Specific gene sub-panels are measured real-time in the blood paxgenes using the methods described in Examples 1 and 2.
  • Three separate gene sub-panels can be used (i.e. 3 algorithms) to identify which individuals will progress to have acute symptoms of an influenza-like disease a.
  • the 3 subpanels of genes are: i.
  • Subpanel A PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2 ii.
  • Subpanel B MAX, NOL9, MPRIP, HP, BST1 and TM9SF2 iii.
  • Subpanel C HOMER3, NSUN6, HP, EPHA4 and BMP2K
  • the stringency can vary by modifying the threshold at which a positive result is obtained.
  • the threshold for gene sub-panel 1 could be set to be more stringent avoiding false positives.
  • Gene sub-panel 2 is then set with lower stringency and gene sub-panel 3 with even lower stringency, thus increasing the chances of identifying and dosing all subjects who will develop acute symptoms of an influenza-like disease as early as possible.
  • results may be combined with a diagnostic test that confirms the subject has the respiratory viral infection relevant to the trial (e.g. viral test).
  • results may be combined with measurements of the change in variance/gradient of symptoms.
  • Other actions that can be triggered alongside dosing with a medicament include increasing the observations/samples/measurements in those predicted to go on develop acute symptoms of influenza-like disease or reducing observations/samples/measurements in those predicted to not develop acute symptoms of an influenza-like disease.
  • the subjects may be dosed at a predetermined time points post exposine (e.g. Day 4, Day 5), with these subjects forming a secondary subgroup analysis.
  • exposine e.g. Day 4, Day 5
  • Subjects may become infected with a respiratory virus in the community or exposed to an infected person for an extended period in the community.
  • Community settings can include at home (family member, household contact), at work, in transit (e.g. on a train, coach, plane, ship), within a care home (fellow resident, family visitor, carer), as an inpatient in hospital (fellow inpatient, healthcare worker, visitor), within military setting (fellow personnel). Following exposure, it would be beneficial to identify, in advance, people who will progress to have acute symptoms of an influenza-like disease, allowing an intervention at the earliest opportunity.
  • Interventions include assisting referral to healthcare professionals, enabling earlier treatment with an antiviral than would otherwise be possible (for example Tamiflu), administration of an immunomodulator drug or combination of antiviral and immunomodulator, separation fiom others (quarantine), inclusion in a study (IMP trial, transmission study), initiate sampling for disease/biomarker monitoring.
  • a subject becomes infected with a respiratory virus in the community or is exposed to an infected person for a prolonged period in the community.
  • Post viral exposure blood samples may be collected and gene levels quantified following one or more triggers: a. a positive diagnostic test (e.g. based on viral replication, diagnostic biomarker) b. initial symptoms of respiratory viral disease c. prolonged exposine to an infected contact
  • Specific gene panels or gene sub-panels are measured real-time in the blood sample (e.g. using a Point of Care test).
  • the absolute value of each gene at a given time-point can be used or alternatively where a baseline gene level was obtained or is available, the gene levels post exposure to virus can be baseline normalised for each subject (i.e. compared to baseline).
  • three separate gene sub-panels are used (i.e. three algorithms) to identify which individuals are likely to develop acute symptoms of an influenza-like disease a.
  • the three subpanels of genes are: i.
  • Subpanel A PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2 ii.
  • Subpanel B MAX, NOL9, MPRIP, HP, BST1 and TM9SF2 iii.
  • Subpanel C HOMER3, NSUN6, HP, EPHA4 and BMP2K
  • a positive result for any test may trigger one or more of the following: i. assisting referral to healthcare professionals, ii. enabling earlier treatment with an antiviral than would otherwise be possible (e.g.Tamiflu), iii. administration of an immunomodulator drug or combination of antiviral and immunomodulator, iv. separation from others (quarantine, using barriers to transmission e.g. masks), v. inclusion in a study (e.g. IMP trial, disease study, transmission study), vi. initiate sampling for disease/biomarker monitoring b.
  • two positive results or three positive results are required to trigger the actions, as listed above.
  • the stringency can vary by modifying the threshold at which a positive result is obtained.
  • the threshold for gene sub-panel 1 could be set to be more stringent avoiding false positives.
  • Gene sub-panel 2 is then set with lower stringency and gene sub- panel 3 with even lower stringency, thus increasing the chances of identifying and intervening as early as possible.
  • results may be combined with a diagnostic test that confirms the subject has the respiratory viral infection (if not already performed).
  • RNA-Seq a revolutionary tool for transcriptomics. Nature Reviews. Genetics, 10(1), 57-63.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Microbiology (AREA)
  • Hematology (AREA)
  • Urology & Nephrology (AREA)
  • Virology (AREA)
  • Biomedical Technology (AREA)
  • Genetics & Genomics (AREA)
  • Physics & Mathematics (AREA)
  • Zoology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Pathology (AREA)
  • Cell Biology (AREA)
  • General Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Medicinal Chemistry (AREA)
  • Food Science & Technology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Peptides Or Proteins (AREA)

Abstract

Methods of predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposure, to a respiratory virus, which comprise analysing a biological sample obtained from the subject for a biomarker and comparing the biomarker to a reference for the biomarker, wherein the biomarker comprises or is derived from expression levels of one or more genes selected from a gene panel comprising PHF20, ABCAI, APBA2, MORC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K. Also disclosed are related predictive methods, methods of conducting a clinical trial or field study, computer programs, classification algorithms, computer readable mediums and computer-implemented methods.

Description

BIOMARKERS AND METHODS FOR CLASSIFYING SUBJECTS FOLLOWING VIRAL
EXPOSURE
Field of the Invention
[0001] The present invention relates to biomarkers for predicting whether a subject will develop acute symptoms or signs of disease following exposine, or possible exposure, to a respiratory virus such, for example, as an influenza virus. The present invention provides methods for predicting whether a subject will develop a severe or complicated form of disease. As disclosed herein, the invention includes methods of conducting clinical trials or field studies comprising analysing the biomarkers, but more generally, the biomarkers of the invention may be used in any healthcare or non-healthcare setting; for example to triage patients infected with a respiratory virus to identify those who are susceptible to developing acute signs or symptoms and may therefore require medical intervention. Subjects may have been administered a medicinal product for treatment or prevention of respiratory disease, and the biomarkers of the invention may therefore be used as a companion analytical product to predict the likely efficacy of the medicinal product. The present invention further provides computer programmes, computer readable media, computer implemented-methods and classification algorithms that generate or utilise the biomarkers of the invention.
Background to the Invention
[0002] Acute upper and lower respiratory infections are a major public health problem and a leading cause of morbidity and mortality worldwide. Viruses are the predominant cause of respiratory tract illnesses and include RNA viruses such as respiratory syncytial virus (RSV), influenza virus, parainfluenza virus, metapneumovirus, rhinovirus (HRV) and coronavirus (Hodinka, “Respiratory RNA Viruses”, Microbiol Spectr., 2016 Aug; 4(4)).
[0003] The CDC estimates that in the 2015-2016 period in the US there were 25 million influenza illnesses, 11 million influenza-associated medical visits, 310,000 influenza-related hospitalizations, and 12,000 pneumonia and influenza deaths (Rolfes et al 2016). In 2003 the annual economic burden of influenza in the US alone was estimated to be around 87 billion dollars (Molinari et al 2007). The costs of influenza are clearly substantial and any method to treat or diagnose influenza would be of enormous value.
[0004] Influenza infects all age groups and causes a range of outcomes from asymptomatic infection and mild respiratory disease through to severe respiratory disease and even death. As such, different subjects exposed to the same influenza virus may be asymptomatic, mildly symptomatic, subclinical, exhibit acute symptoms, or require medical attention, or even urgent hospitalization (Cox et al 1999). Further, the proportion of infections that are asymptomatic or subclinical, and the degree to which these are contagious, as well as the proportion of shedding which occurs prior to onset of symptoms, affect the potential impact of control measures and decisions regarding treatment and the administration of medicaments (Lau et al., 2010).
[0005] Current trial designs within the human challenge model for assessing investigational treatment drugs and medicaments for influenza, RSV, or HRV rely on either: a. “universal dosing” - Universally treating all subjects inoculated with virus on a given day post inoculation (e.g. 24 hrs or 28 hrs post inoculation), irrespective of whether the subjects become infected or not; b. "Triggered dosing” - Treating only those subjects when they have either one or both of the following: i. their first (or confirmed) PCR positive respiratory sample (i.e. treating only those that are expected to be infected post inoculation); ii. initial respiratory symptoms that are indicative of onset of viral infection; c. ‘Triggered dosing + Universal dosing” (DeVincenzo et al, NEJM 2014; DeVincenzo et al, NEJM 2015)- This uses the principles of triggered dosing for the primary endpoint, however at a certain day post inoculation e.g. Day 5 any subjects that still don’t have a positive viral sample (or symptoms) are subsequently given the drug anyway. The ones that are a universally given the drug in this scenario may be included for analysis in two sub-analysis approaches: i. On their own as a sub-group ii. Combined with the triggered sub-group
[0006] In research models such as the human challenge model, knowing who will develop significant symptoms in advance, would allow dosing of an investigational medicament to be triggered only in subjects who would otherwise go on to develop acute symptoms of an influenza-like disease. A method capable of predicting who will develop acute symptoms of an influenza-like disease would allow the identification of subjects appropriate for administration of the medicament. The benefits of this volunteer selection method for dosing include: a. Improved ability to detect a clinically relevant reduction in disease by only evaluating the medicament effects in those that would have gone on to present with acute symptoms of an influenza-like disease. This contrasts with trial designs where triggering of treatment might be based on presence of viral shedding/symptoms or administration of the medicament to all inoculated people. Selecting appropriate subjects for the trial in advance avoids the problems associated with assessing the efficacy of the medicament in populations where the ability to detect a difference is more difficult (i.e. uninfected, asymptomatic infected or people who only have a mild infection with minimal viral loads). b. fewer people will be exposed to the medicament unnecessarily thereby: i. reducing medicament requirements and thus manufacturing and cost benefits; ii. providing a treatment regime with an improved benefit : risk profile by selecting to provide treatment only to those that will develop acute symptoms; iii. providing an improved benefit : risk profile for both the medicament and the study by requiring fewer people to be exposed to an investigational medicament.
[0007] Therefore, there remains a need for methods of predicting whether a subject will develop acute symptoms of an influenza-like disease to enable informed treatment decisions, to administer the correct level of care, and/or to improve the trial design for investigative medicaments to treat influenza- like disease.
[0008] Woods et al., “A Host Transcriptional Signature for Presymptomatic Detection of Infection in Humans Exposed to Influenza H1N1 or H3N2”, PLOS ONE, January 2013; 8(1): e52198 describe the generation of a viral gene signature (or factor) for symptomatic influenza that is capable of detecting 94% of infected cases. The gene signature is detectable as early as 29 horns post-exposure and is reported to achieve maximum accuracy on average 43 hours (p = 0.003, H1N1) and 38 hours (p-value equals 0.005, H3N2) before peak clinical symptoms.
[0009] While Woods et al. discloses methods for identifying a subject infected with a respiratory virus prior to presentation of symptoms, such methods do not predict whether or not an individual will develop acute symptoms of an influenza-like disease.
Summary of the invention
[00010] The present invention provides a biomarker for predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposine, to a respiratory virus, wherein the biomarker comprises or is derived from expression levels of one or more genes selected from a gene panel comprising PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, H0MER3, NSUN6, EPHA4 and BMP2K measured in a biological sample obtained from the subject after exposure, or possible exposure, to a respiratory virus.
[00011] A biomarker in the context of the present invention is a measurable indicator of biological state or condition, in particular, an output to predict whether a subject will develop acute symptoms of disease. The output may be a numerical output. In some embodiments, the biomarker of the invention may be a composite biomarker comprising expression levels of two or more genes of the gene panel or the expression level of at least one gene of the gene panel in combination with at least one other factor as described herein.
[00012] The subject may be a human or non-human mammal.
[00013] The acute symptoms of disease may consist of symptoms of an influenza-like or other respiratory disease, as disclosed herein.
[00014] Acute symptoms of an influenza-like or other respiratory disease means that the subject experiences four or more of the following symptoms and these symptoms, either individually or combined, interfere with daily activities. The symptoms include runny nose, stuffy nose, sore throat, sneezing, earache, cough, shortness of breath, wheezing, chest tightness, headache, malaise, myalgia, muscle and/or joint aches, elevated temperature, chilliness, and feverishness. The elevated temperature may be a temperature of 38 °C or more, optionally experienced together with a cough, optionally with onset within the last 8-12 (e.g. 10) days. During an average human challenge study involving inoculation with a respiratory virus in which the symptoms of influenza-like or other respiratory disease are evaluated a subset of subjects will have acute symptoms, these subject’s symptoms will score in the 85th percentile, for example the subject’s total VAS or CAT score will be in the 85th percentile. For example, a total VAS score of greater than or equal to 25 units, or a CAT score greater than or equal to 10 units, equates to acute symptoms.
[00015] Thus in some embodiments, a subject may be identified as susceptible to progression to a complicated form of an influenza-like or other respiratory disease. A complicated respiratory disease such as influenza is defined as disease requiring hospital admission and/or with symptoms and signs of lower respiratory tract infection (hypoxaemia, dyspnoea, lung infiltrate), central nervous system involvement and/or a significant exacerbation of an underlying medical condition.
[00016] If a subject is predicted to develop acute symptoms of an influenza-like or other respiratory disease, such as complicated flu or other respiratory disease, then in accordance with the present invention, it is predicted that the subject will go on to exhibit acute symptoms as defined above. It may be predicted that a subject will develop acute symptoms of an influenza-like or other respiratory disease, but the subject may self-resolve, or action may be taken to prevent the subject developing acute symptoms; for example a medicament may be administered. Thus, a subject predicted to develop acute symptoms of an influenza-like or other respiratory disease does not inevitably develop acute symptoms of an influenza- like or other respiratory disease.
[00017] Respiratory virus includes all viral infections of the respiratory tract including respiratory syncytial virus (RSV), parainfluenza virus (HPIV), metapneumovirus (HMPV), rhinovirus (HRV), coronavirus, adenovirus (HAdV), enterovirus (EV), bocavirus (HBoV), parechovirus (HPeV), influenza including influenza A and influenza B.
[00018] A gene panel in the context of the present invention is a set of genes the expression levels of which can be analysed and used to predict the progression and/or outcome of an influenza-like or other respiratory disease in a subject. A gene sub-panel is a set of genes selected from a gene panel which may be used to predict at a certain stage in the progression of the disease, for example early, mid or late stage progression towards possibly developing acute symptoms of an influenza like or other respiratory disease, or at a certain time point following inoculation with, or exposine to, a respiratory virus, for example up to 25 hours (for example 13-25 hours), or 37-49 hours, or 49-61 hours, in some embodiments these time frames may be referred to as early, mid or late stage respectively. In some circumstances it may not be possible to determine when a subject was exposed to a respiratory virus, in addition some subjects develop symptoms quicker than other subjects, and so disease progression and stage of disease progression may be estimated based on evaluated symptoms.
[00019] In particular the gene panel of the present invention may include one or more genes (including two genes, three genes, four genes, five genes, six genes, etc.) selected from PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
[00020] In some embodiments, the gene panel may consist of up to 16 genes (typically up to 10 genes, and more typically up to 6 genes) including one or more genes (including two genes, three genes, four genes, five genes, six genes, etc.) selected from PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
[00021] In some embodiments, the gene panel may comprise PHF20. In some embodiments, the gene panel may comprise NOL9. In some embodiments, the gene panel may comprise both PHF20 and NOL9.
[00022] The gene panel of the present invention may consist of one, two, three, four, five or six genes selected from PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K. Thus, for example, the gene panel of the present invention may consist of one, two, three, four, five or six genes, including PHF20 and optionally NOL9.
[00023] Unless indicated otherwise, the present invention does not exclude the possibility of including within the gene panel one or more further genes not specifically disclosed herein, which may be found to improve further the accuracy, sensitivity or specificity of the methods of the invention. [00024] A first gene sub-panel may comprise PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2.
[00025] Thus, the first gene sub-panel may comprise the expression level of PHF20. In addition to the expression level of PHF20, the first gene sub-panel may comprise the expression level of one or both of APBA2 and ABCA1. Where the first gene sub-panel comprises the expression level of PHF20 and the expression level of one or both of APBA2 and ABCA1, the first gene sub-panel may additionally comprise the expression level of one, two or three of M0RC2, SNU13 and DCUN1D2.
[00026] The first gene sub-panel may consist of one, two, three, four, five, or six of PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2.
[00027] The first gene sub-panel may consist of one gene, which is PHF20. The first gene sub-panel may consist of two genes, one of which is PHF20. The first gene sub-panel may consist of three genes, one of which is PHF20. The first gene sub-panel may consist of four genes, one of which is PHF20. The first gene sub-panel may consist of five genes, one of which is PHF20. The first gene sub-panel may consist of six genes, one of which is PHF20.
[00028] The first gene sub-panel may consist of two genes, one of which is PHF20, and the other of which is APBA2 or ABCA1. The first gene sub-panel may consist of three genes, including PHF20, accompanied by one or both of APBA2 and ABCA1. The first gene sub-panel may consist of four genes, including PHF20, accompanied by one or both of APBA2 and ABCA1. The first gene sub-panel may consist of five genes, including PHF20, accompanied by one or both of APBA2 and ABCA1. The first gene sub-panel may consist of six genes, including PHF20, accompanied by one or both of APBA2 and ABCA1.
[00029] A second gene sub-panel may comprise MAX, NOL9, MPRIP, HP, BST1 and TM9SF2.
[00030] Thus, the second gene sub-panel may comprise the expression level of one or more of NOL9, HP and MAX (particularly NOL9). In addition to the expression level of one or more of HP, MAX and NOL9 (particularly NOL9) the second gene sub-panel may comprise the expression level of one or both of BST1 and MPRIP. Where the second gene sub-panel comprises the expression level of one or more of HP, MAX and NOL9 (particularly NOL9), and the expression level of one or both of BST1 and MPRIP, the second gene sub-panel may additionally comprise the expression level of TM9SF2.
[00031] The second gene sub-panel may consist of one, two, three, four, five, or six of MAX, NOL9, MPRIP, HP, BST1 and TM9SF2.
[00032] The second gene sub-panel may consist of one gene, which is NOL9, HP or MAX (particularly NOL9). The second gene sub-panel may consist of two genes, one or both of which are selected from NOL9, HP and MAX (particularly NOL9). The second gene sub-panel may consist of three genes, one, two or all of which are selected from NOL9, HP and MAX (particularly NOL9). The second gene sub- panel may consist of four genes, one, two or three of which are selected from NOL9, HP and MAX (particularly NOL9). The second gene sub-panel may consist of five genes, one, two or three of which are selected from NOL9, HP and MAX (particularly NOL9). The second gene sub-panel may consist of six genes, one, two or three of which are selected from NOL9, HP and MAX (particularly NOL9).
[00033] The second gene sub-panel may consist of two genes, one of which is NOL9, HP or MAX (particularly NOL9) and the other of which is BST1 or MPRIP. The second gene sub-panel may consist of three genes, including one or more of NOL9, HP or MAX (particularly NOL9), accompanied by one or both of BST1 and MPRIP. The second gene sub-panel may consist of four genes, including one or more of NOL9, HP or MAX (particularly NOL9), accompanied by one or both of BST1 and MPRIP. The second gene sub-panel may consist of five genes, including one or more of NOL9, HP or MAX (particularly NOL9), accompanied by one or both of BST1 and MPRIP. The second gene sub-panel may consist of six genes, including one or more of NOL9, HP or MAX (particularly NOL9), accompanied by one or both of BST1 and MPRIP.
[00034] A third gene sub-panel may comprise HOMER3, NSUN6, HP, EPHA4 and BMP2K.
[00035] Thus, the third gene sub-panel may comprise the expression level of one or both of HP and HOMER3. In addition to the expression level of one or both of HP and H0MER3, the third gene sub- panel may comprise the expression level of one or both of EPHA4 and BMP2K. Where the third gene sub-panel comprises the expression level of one or both of HP and H0MER3, the expression level of one or both of EPHA4 and BMP2K, the third gene sub-panel may additionally comprise the expression level ofNSUN6.
[00036] The third gene sub-panel may consist of one, two, three, four or five of H0MER3, NSUN6, HP, EPHA4 and BMP2K.
[00037] The third gene sub-panel may consist of one gene, which is HP or H0MER3. The third gene sub- panel may consist of two genes, one or both of which are selected from HP and HOMER3. The third gene sub-panel may consist of three genes, one or two of which are selected from HP and HOMER3. The third gene sub-panel may consist of four genes, one or two of which are selected from HP and H0MER3. The third gene sub-panel may consist of five genes, one or two of which are selected from HP and HOMER3.
[00038] The third gene sub-panel may consist of two genes, one of which is HP or H0MER3; and the other of which is EPHA4 or BMP2K. The third gene sub-panel may consist of three genes, including one or both of HP and H0MER3, accompanied by one or both of EPHA4 and BMP2K. The third gene sub- panel may consist of four genes, including one or both of HP and H0MER3, accompanied by one or both of EPHA4 and BMP2K. The third gene sub-panel may consist of five genes, including one or both of HP and H0MER3, accompanied by one or both of EPHA4 and BMP2K.
[00039] In some embodiments, any of the aforementioned gene panels may further comprise 1 to 2 genes in addition to those listed above, without departing from the essential character of the panels of the present disclosure.
[00040] As shown in the Examples below, the genes that have been identified as being predictive of a subject developing acute symptoms of an influenza like or other respiratory disease, in accordance with the present invention, exhibit altered expression levels following inoculation with a virus in subjects who then go on to exhibit acute symptoms relative to those who do not develop acute symptoms, as defined above. This indicates the potential of the genes to identify subjects who are more likely to develop acute symptoms of an influenza-like or other respiratory disease. Since symptoms of viral infection develop sooner in some subjects than in others, altered expression of the one or more genes according to the present invention may be predictive of acute symptoms, before a subject shows any symptoms of infection, or an early diagnostic indicator of acute symptoms at about the same time as the subject starts to show the first symptoms of infection.
[00041] The expression levels of the one or more genes in the biological sample may be measured using any suitable method known in the art for quantifying the expression level of a gene, particularly a mammahan gene. In some embodiments, the expression level of the one or more genes may be measured by quantifying mRNA transcripts of the one or more genes according to the invention in the biological sample.
[00042] Preferably, a PCR-based method may be used such, for example, as RT-qPCR. Examples of RT- qPCR-based methods are disclosed by United States patent no. 7,101,663, the contents of which are incorporated herein by reference. An advantage of real-time PCR is its relative ease and convenience of use.
[00043] Alternatively, a gene expression microarray may be used of the kind disclosed in, for example, United States patent no. 6,040,138, the contents of which are incorporated herein by reference, in which a pool of labelled target cRNA molecules, which are obtained by transcribing double-stranded cDNA derived from the mRNA transcripts that are isolated from the biological sample and fragmenting the resulting cRNA transcripts, are hybridised to oligonucleotide probes having specific sequences that are immobilised at specific addresses on a solid support. After incubating the cRNA targets with the surface- bound probes, the arrays are washed and the labels on the targets may be used to quantify how much target is bound to any given feature on the array. The amount of a given surface-bound target cRNA is proportional to the expression level of the corresponding gene.
[00044] Alternatively, RNA-seq may be used to quantify, discover and profile RNAs. This uses next- generation sequencing on cDNA converted from RNA (Wang et al 2009).
[00045] Suitably, the biological sample may be a blood or a respiratory sample. In particular, the sample may be a sample containing immune cells.
[00046] In some embodiments, the expression level of each of the one or more genes may be compared with a respective reference level. The reference level may be a threshold expression level that indicates acute symptoms of an influenza -like or other respiratory disease or a prediction of acute symptoms developing. Alternatively, the reference level may be a baseline level of expression which indicates that the subject is unlikely to develop acute symptoms of an influenza-like or other respiratory disease.
Significantly altered expression (increased expression or decreased expression) of the one or more genes relative to their respective baseline levels, for instance by at least 1. lx, preferably at least 1.5x or 2x, or 3x, or 4x, or 5x, etc. up to 100x, may be indicative of acute symptoms of an influenza like or other respiratory disease or predictive that a subject will develop acute symptoms of an influenza like or other respiratory disease.
[00047] In some embodiments, the method may involve an individual reference level for each gene. Altered expression of at least one of the genes, preferably two or more of the genes, relative to their respective reference levels may indicate acute symptoms of an influenza like or other respiratory disease or predict developing acute symptoms of an influenza like or other respiratory disease in accordance with the present invention.
[00048] In some embodiments, the reference level for the, or each, gene may be a previously measured expression level for the gene in the same subject. In particular, the reference level for the, or each, gene may comprise a baseline expression level of the gene for the subject which is measured at a time when the subject is known not to be infected with a respiratory virus such, for example, as influenza. Where previous expression levels for the one or more genes, measured on more than one previous occasion, are available for a subject, the reference level for each gene may comprise an average of multiple previous levels.
[00049] Thus, in some circumstances, a subject may be tested once to obtain baseline levels for the one or more genes, which form reference levels that may be used subsequently in case of suspected viral infection or a routine check, for comparison with contemporaneous expression levels to predict whether or not the subject is likely to develop acute symptoms of an influenza like or other respiratory disease.
[00050] Exposure to a respiratory virus, or possible exposure to a respiratory virus, includes any contact or possible contact with a respiratory virus including, exposure to a community acquired respiratory viral infection, exposure to respiratory virus at home, within a care home, hospital or military setting. Exposure also includes inoculation of subjects during a human challenge model and/or clinical trial.
[00051] As explained above, it is not always possible to ascertain when a subject has been exposed to a respiratory virus. Furthermore, different subjects exhibit symptoms at different time points, with some subjects exhibiting symptoms earlier than other subjects. Therefore the progression of an influenza-like or other respiratory disease may be measured on a relative scale and may be referred to as early, mid or late stage progression towards a possible presentation of acute symptoms of an influenza-like or other respiratory disease. In other circumstances, for example following inoculation in a human challenge model, the precise timing of the exposure to a respiratory virus is known and it is possible to measure time from exposure in hours, for example up to 25 hours (for example 13-25 hours) after exposure.
[00052] As explained in the Figures and the Examples below, a single gene panel, containing one or more genes, may be analysed using a first algorithm. Alternatively, a combination of gene panels and gene-sub panels may be analysed. The different gene panels and gene sub-panels may be analysed simultaneously, for example using the same biological sample or samples obtained within a similar, or the same, time fame. The different gene panels and gene sub-panels may be analysed sequentially with a first gene panel or gene sub-panel analysed in a sample taken at a first time point using a first algorithm, and a second gene panel or gene sub-panel analysed in a sample taken at a second time point using a second algorithm, and a third gene panel or gene sub-panel analysed in a sample taken at a third time point using a third algorithm etc.
[00053] As mentioned above, the biomarker of the invention may be based on a number of input variables or factors, including gene expression levels, for example the age of the subject, or other underlying conditions that a subject may suffer e.g. asthma, may be included in the variables used to calculate the biomarker. The biomarker may therefore be a composite biomarker. The output of the biomarker may be a numerical value. The numerical value may be determined using a threshold, reference level, or baseline level, for example a numerical value above a certain reference level predicts that a subject will develop acute symptoms of an influenza-like or other respiratory disease.
[00054] The biomarker may be computer-generated and comprises an output variable of a classification algorithm that uses as input variables the expression levels of one or more genes in the gene panel; or one or more genes in the first gene sub-panel; or one or more genes in the second gene sub-panel; or one or more genes in the third gene sub-panel.
[00055] The classification algorithm may be configured to prioritise accuracy such that the algorithm produces the greatest number of correct predictions. Thus, the classification algorithm may be configured to prioritise Negative Predictive Value (NPV), the proportion of negative test results that are true negatives, the aim being to minimise the number of subjects predicted not to develop acute symptoms of influenza-like disease who in fact go on to develop acute symptoms of an influenza-like or other respiratory disease.
[00056] The classification algorithm may be derived by machine-learning fiom a training data-set that uses as input variables expression levels of one or more genes fiom the gene panel measured fiom a biological sample obtained fiom a group of subjects at a predetermined time after exposure to the respiratory virus, wherein the group of subjects is divided into two classes according to whether or not they developed acute symptoms of an influenza-like or other respiratory disease after exposure to the respiratory virus, and wherein the classification algorithm operates on the expression levels to produce an output variable that differentiates between the classes.
[00057] Numerous classification algorithms are available to those skilled in the art for classifying subjects into two or more classes based on their symptoms scores. Similarly, numerous machine learning techniques are available for using a training dataset comprising the two or more classes and their respective expression levels for the one or more genes to derive a classification algorithm that is able to classify a new subject based on their expression levels of the one or more genes. The performance of a classification algorithm built using a machine learning process may be validated using one or more known validation methods, e.g. cross-validation, and calculating statistical parameters (e.g. accuracy, sensitivity, specificity) so that the person skilled in the art can obtain a classification algorithm that is best suited for classifying subjects based on their expression levels of the one or more genes.
[00058] The acute symptoms of an influenza-like or other respiratory disease in the subjects of the group in the training data set may be assessed by evaluating one or more symptoms of influenza-like or other respiratory disease at a series of pre-set times after exposine to the respiratory virus. The one or more symptoms are evaluated by the subjects using diary cards, optionally visual analogue scare symptom diary cards (VAS), or optionally categorical symptoms (CAT) are recorded using a modified standardized symptom score for example the modified Jackson Score. The symptoms evaluated may include runny nose, stuffy nose, sore throat, sneezing, earache, cough, shortness of breath, headache, malaise, myalgia, muscle and/or joint aches, chilliness, and feverishness.
[00059] The first class of subjects may record a total VAS of greater than or equal to 25 units and/or a total CAT score of 10 units or greater, or may show one or more of: greatest variance in total VAS or CAT up to the peak of symptoms; greatest variance in total VAS or CAT over the duration of quarantine; or steepest gradient (slope of regression line) of total VAS or CAT up to the peak of symptoms.
[00060] Typically, machine learning processes and the resulting classification algorithms may be carried out using a computer.
[00061] The gene panels and gene sub-panels may be selected by i) analysing expression levels in biological samples obtained from the group of subjects in the data training set across the whole series of pre-set times after exposure to the virus; and ii) identifying genes that show a nominal association with acute symptoms of an influenza-like or other respiratory disease, and iii) using a variable selection process to select panels of the identified genes whose expression levels at a predetermined time after exposure to the virus exhibit maximal predictive value for developing acute symptoms of an influenza- like or other respiratory disease.
[00062] The variable selection process may comprise subjecting the expression levels of the identified genes at the predetermined time after exposure to the respiratory virus to a repeated gradient boosting process and selecting a set of 1, 2, 3, 4, 5 or 6 genes that are selected most frequently by the gradient boosting process. A variable selection process is illustrated in FIG. 6 of the accompanying drawings and is performed by a gradient boosting machine (GBM ; Friedman 2001; Friedman 2002). In the context of the present invention, differential expression analysis of genes between subjects that developed acute symptoms of influenza-like or other respiratory disease and subjects that did not develop acute symptoms of influenza-like or other respiratory disease was performed by application of a cubic p-spline model. Nominal associations arising from the cubic spline analysis were input into a variable selection process comprising gradient boosting machine, and iterative searches were conducted using fifty starting point (seeds), to determine the best gene predictors of developing acute symptoms of influenza-like or other respiratory disease.
[00063] The biomarker may be used to allocate subjects to groups in a clinical trial or to make treatment decisions. Subjects allocated to one subgroup are administered a medicament while those allocated to another subgroup are not administered a medicament or do not receive a medicament until later in the trial or study. The biomarker may also be used to monitor the efficacy of a medicament by assessing whether a subject to whom the medicament has been administered is likely to develop acute symptoms of disease after exposure, or possible exposure, to a respiratory virus. The medicament may comprise a therapeutic agent or a preventative agent such, for example, as a vaccine.
[00064] Thus, the present invention provides a method of predicting whether a subject will develop acute or complicated symptoms of disease after exposure, or possible exposure, to a respiratory virus which comprises analysing a biomarker according to the invention and comparing the biomarker to a reference for the biomarker.
[00065] The present invention also provides a method of conducting a clinical trial or field study in which a group of subjects are exposed to a respiratory virus, the method comprising analysing a biomarker according to the invention for each subject and comparing the biomarker to a reference for the biomarker to predict whether the subject is likely to develop acute symptoms of disease, and including subjects who are predicted to develop acute symptoms of disease in a first subgroup of the clinical trial or field study and including subjects who are predicted not to develop acute symptoms of disease in a second subgroup.
[00066] As explained above, the methods of the invention may include comparing the biomarker to a reference for the biomarker or to a baseline for the biomarker. The baseline for the biomarker may be determined at a time when the subject is known not to be infected with a respiratory virus. The disease may be an influenza-like or other respiratory disease.
[00067] A medicament includes all substances used for medical treatment and includes vaccines, drugs, placebos and investigational medicaments, for example investigational medicaments that are the subject of a clinical trial. Therefore medicament includes licensed, unlicensed and investigational medicaments. Medicament also includes products that already have a marketing authorisation but that are being tested for a different use, or for efficacy when assembled in a different way, or tested to gain further information about the authorised use.
[00068] The invention also provides a computer program for predicting whether a subject will develop acute symptoms of disease such, for example, as an influenza-like disease after exposure, or possible exposure, to a respiratory virus, which comprises instractions which, when the program is executed by a computer, cause the computer to generate a biomarker according to the invention.
[00069] The invention further provides a classification algorithm for predicting whether a subject will develop acute symptoms of disease such for example as an influenza-like or other respiratory disease after exposure, or possible exposure, to a respiratory virus, wherein the classification algorithm is derived by analysing expression levels of one or more genes in subjects who have developed acute symptoms of disease and comparing with the expression levels in subjects who do not develop acute symptoms of disease, wherein the one or more genes are PHF20, AB CAI, APBA2, M0RC2, SNU13 DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
[00070] The invention further provides a computer readable medium and/or computer program comprising instructions which, when executed by a computer, cause the computer to carry out the classification algorithm according to the invention.
[00071] The invention also provides a computer-implemented method for predicting whether a subject will develop acute symptoms of disease such for example as an influenza-like disease, wherein a biomarker is generated by analysing expression levels of one or more genes in subjects who have developed acute symptoms of disease following inoculation with a respiratory virus and comparing with the expression levels in subjects who do not develop acute symptoms of disease following inoculation with a respiratory virus, wherein the one or more genes are PHF20, ABCA1, APBA2, MORC2, SNU13 DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
[00072] The invention also provides a computer-implemented method, wherein the method comprises a graphical user interface which displays the biomarker to the user. It is also contemplated that the computer-implemented aspects of the invention may be carried out by more than one computer e.g. two or more computer operating in different locations. Two or more computers may communication via a data channel, for example the internet.
[00073] The invention also provides a method of predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposure, to a respiratory virus, which comprises estimating time elapsed after the exposure, or possible exposure, to the respiratory virus by analysing expression levels of one or more genes selected from PHF20, AB CAI, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, H0MER3, NSUN6, EPHA4, in a biological sample obtained from the subject; selecting a biomarker described herein, which at said time exhibits maximal predictive value for developing acute symptoms of disease; and comparing the biomarker to a reference for the biomarker.
[00074] For example, in some embodiments, the time elapsed may be estimated to be about a day (i.e. about 23-26 hours, e.g. 25 hours) after exposine, or possible exposure, to the respiratory virus. If so, the selected biomarker may comprise expression levels of one or more genes from the first gene sub-panel described herein.
[00075] The time elapsed may be estimated to be about 1.5-2 days (i.e. about 37-49 hours) after exposure, or possible exposure, to the respiratory virus. If so, the selected biomarker may comprise expression levels of one or more genes from the second gene sub-panel described herein.
[00076] The time elapsed may be estimated to be he about 2-2.5 days (i.e. about 49-61 hours) after exposure, or possible exposure, to the respiratory virus. If so, the selected biomarker may comprise expression levels of one or more genes from the third gene sub-panel described herein. [00077] The invention also provides a kit for use in a method according to the invention. The kit may comprise one or more reagents that allow detection, optionally quantitation, of one or more nucleotides, or one or more peptides, corresponding to one or more genes from the gene panel or the first, second or third gene sub-panel described herein. The kit may be for detection of one or more analytes in, or extracted from, a biological sample, such as, but not limited to, a blood, serum, plasma, mine, saliva, tissue biopsy, stool, sputum, skin, nose or throat sample. The kit may comprise a device for conducting an assay, such as a lateral flow assay. The device may be configured for autonomous use by a patient (without assistance from a physician), for example in the home (as opposed to a hospital or other medical facility). The device may comprise a strip of porous material, which is capable of supporting capillary flow, wherein there is a zone for receiving a sample; a zone comprising a reagent for detection of an analyte; a detection zone; and a control zone. In use, a reaction between the reagent and the analyte may be detected in the detection zone, for example by a change in colour of material in the detection zone. In use, the control zone may serve as a reference against which to benchmark the reaction detected in the detection zone. The device may comprise more than one test strip, for example the device may comprise two, three, four, five or six test strips, in communication with the same or separate receiving zones.
[00078] Literature references to and sequence listings for the above-mentioned genes are included at the end of this description. It will be understood that the reference and sequence listings necessarily disclose specific alleles and are included by way of example only. The invention is not limited to the use of such specific alleles, but may also be implemented using products of expression of different variants of the one or more genes.
Brief description of the drawings
[00079] Following is a description by way of example only with reference to the accompanying drawings of embodiments of the present invention.
[00080] In the drawings:
[00081] FIG. 1 is a chart showing how a subject is processed through a typical clinical study as described in Example 1 below.
[00082] FIG. 2 is a graph showing peak level of Total VAS for infected (left) and non-infected (right) subjects.
[00083] FIG. 3 is a graph showing change in maximum variance of VAS scores demonstrating that the four individuals with acute symptoms of influenza-like disease at peak all experienced a change in variance of Total VAS greater than 30 units. [00084] FIG. 4 is a graph showing peak level of peak categorical scores for infected (left) and non- infected (right) subjects across three studies (hVIVO-l, Duke-1, Duke-2).
[00085] FIG. 5 is a principal component analysis to show greater homogeneity after imposing adjustment for study.
[00086] FIG. 6 is a flow chart to demonstrate the variable selection process performed by gradient boosting machines.
[00087] FIG. 7 is a flow diagram to demonstrate scenarios in which 1 to 3 of the algorithms are run in parallel to assign subjects to groups for actioning (e.g. dosing with a medicament, additional clinical assessments). The scenarios include: within the human viral challenge model, field study, or in the community, wherein subjects are exposed to a respiratory virus.
[00088] FIG. 8 is a flow diagram to demonstrate scenarios in which each gene algorithm is used sequentially to assign subjects to groups for actioning (e.g. dosing with a medicament, additional clinical assessments) within the human viral challenge model.
Examples
Example 1
[00089] As described below, subjects from three separate studies were determined to include a subset of subjects that exhibited acute symptoms of an influenza-like disease based on their peak symptom score across the quarantine.
Methods
[00090] Afiymetrix™ HG-U133 Plus 2.0 microarray across the whole quarantine post inoculation were used to perform transcriptomics analysis. Differential expression analysis between subjects developing acute symptoms of an influenza-like disease and subjects that did not develop acute symptoms of an influenza-like disease was performed by application of a cubic p-spline model. Nominal associations arising from the cubic spline analysis were input into a variable selection process to determine the best six gene predictors of acute symptoms of influenza-like disease at Day 1 morning, Day 2 morning, and Day 2 evening after inoculation. These genes could be used to distinguish between subjects that developed acute symptoms of an influenza-like disease and subjects that did not develop acute symptoms of an influenza- like disease within the model at different times after exposine to virus.
[00091] Data from three separate studies was combined for this analysis, the largest of which was run by hVIVO (hVIVO-l) and two publicly available studies also run in the hVIVO challenge model with Duke University Medical Centre (Duke-1 (Zaas et al 2009) and Duke-2 (Woods et al 2013): indexed in GEO as GSE52428 and publicly available):
Table 1 - For each study the following are demonstrated; virus used, the number of subjects, the microarray platform used, the time points at which PaxGene blood samples were taken, the methods used in diary cards to measure symptoms (VAS = Visual analogue scale, CAT = modified Jackson score/categorical), the number of diary cards taken a day and the method used to confirm influenza infection.
[00092] For hVIVO-1, 60 healthy volunteers were inoculated intranasally with influenza A H3N2 Perth/16/2009, 27 of which were used for this analysis. For Duke-1, 17 healthy volunteers were inoculated intranasally with influenza A H3N2 A/Wisconsin/67/2005. For Duke-2, 24 healthy volunteers were inoculated intranasally with Influenza A H1N1 A/Brisbane/59/2007. All volunteers provided informed consent and underwent extensive pre-enrolment health screening (FIG. 1) and any with significant baseline antibodies to the strain of influenza utilised were excluded. After approximately 48 hours in quarantine (approximately mid-day on study day 0), a predetermined dose of influenza A was instilled into bilateral nares of subjects using standard pipetting methods. The volunteers had clinical measurements and samples taken until discharged from quarantine and then at each follow up visit.
[00093] In hVIVO-1, 33 of the 60 subjects became infected after inoculation (evidenced by confirmed viral shedding), 25 were identified as not infected and 2 inconclusive. An interim analysis was performed after the first 27 were inoculated and all samples for each subject were sent for gene microarray assays. One of the 27 subjects did not complete the quarantine and so was excluded from analysis. Of the 26 subjects with viable microarray data, 13 were identified as confirmed as infected and 11 as not infected, 2 were inconclusive.
[00094] In Duke-1, 9 of the 17 subjects became infected after inoculation (evidenced by confirmed viral shedding) and 8 were identified as not infected. Four dilutions used (6.41 TCID50/mL, 5.25 TCID50/mL, 4.41 TCID50/mL and 3.08 TCID50/mL) with four to five subjects receiving each dose.
[00095] In Duke-2, 9 of the 24 subjects became infected after inoculation (evidenced by confirmed viral shedding) and 15 were identified as not infected. Four dilutions used (2.35 TCID50/mL, 1.8 TCID50/mL, 1.25 TCID50/mL and 1.4 TCID50/mL) with four to six subjects receiving each dose. One subject was excluded due to a secondary infection.
[00096] Subjects had influenza infection confirmed based on qualitative viral culture and quantitative influenza RT-PCR data from epithelial lining fluid for the hVIVO study. Epithelial lining fluid was collected from nasopharyngeal FLOQ swabs twice daily (starting on Day 1 morning, first sample approximately 20 hrs post inoculation). Nasal collection continued throughout the duration of the quarantine. For the Duke studies infection status of the subjects were obtained from the Woods et al, 2013.
[00097] Subjects self-assessed their symptoms three times daily throughout quarantine on both categorical and continuous (Visual Analogue Scale, VAS) symptom diary cards. Categorical symptoms were recorded using a modified standardized symptom score. The modified Jackson Score requires subjects to rank 10 symptoms consisting of: upper respiratory tract symptoms (runny nose, stuffy nose, sore throat, sneezing, and earache), lower respiratory symptoms (cough and shortness of breath) and systemic symptoms (headache, myalgia, and muscle and/or joint aches) on a scale of 0-3 of “no symptoms”, “just noticeable”, “bothersome but can still do activities” and “bothersome and cannot do daily activities”. hVIVO-1 included wheeze and chilliness/feverishness in addition to the 10 symptoms. Additionally, shortness of breath at rest and wheeze at rest were also recorded using an additional grade for these symptoms only (grade 4 = symptoms at rest). VAS symptoms were measure along a 10cm line, measurements were made in mm.
[00098] To determine which subjects were considered to have significant symptoms initially subjects were identified in hVIVO-1 using the VAS data which was recorded alongside the categorical score. Total VAS score was calculated for each time-point dining quarantine for all fifty-eight evaluable participants from hVIVO-1. Peak VAS score was determined for each participant, and it was observed that the range in non-infected individuals was 0-20 units. Amongst the infected set however, four individuals experienced Total VAS > 25 units (FIG. 2). The four individuals were distinguishable from other participants in additional ways; Greatest variance in Total VAS up to the peak of symptoms , Greatest variance in Total VAS over the duration of quarantine and Steepest gradient (slope of regression line) of Total VAS up to the peak of symptoms
[00099] In order to address the question of whether the four could have been identified early in quarantine, piece-wise estimates of variance and change in variance were calculated for the period of quarantine.
[000100] It was observed that the four individuals with most severe symptoms at peak all experienced a change in variance of Total VAS greater than 30 units (FIG. 3). Remaining individuals had maximum change in variance less than 25 units. Furthermore, the change was observed consistently at Day 2, time-point 2, which in three of the four instances, was in advance of symptom peak (Table 2).
Table 2
[000101] Upon the inclusion of Duke- 1 and Duke-2 to the analysis the severity score had to be adapted for use with the categorical symptom score due to not having VAS available in Duke-1 and Duke-2. A similar analysis as was conducted for VAS (above) was now performed for categorical symptom score, for the three studies. It was observed that across the three studies, 11 individuals had peak total categorical score greater than or equal to 10 units (10 subjects with microarray data). These included the four members of hVIVO-1 already discussed. Four individuals from the H1N1 study, and three individuals from the H3N2 study also passed this threshold (FIG. 4). As with VAS, sudden changes in the variance of categorical score was associated with acute symptoms of an influenza-like disease.
[000102] At predetermined intervals, blood was collected into RNA PAXGene™ collection tubes. This occurred once on Day -1, in the morning on the day of inoculation (approximately 5 hours before inoculation) followed by every 12 hours for hVIVO-1 and Duke-2 and every 8 horns for Duke-1 for the remainder of the quarantine.
[000103] All three studies used GeneChip® Human Genome U133A 2.0 Array (Affymetrix, Santa Clara, CA) for the microarray. Microarray data for both Duke studies was obtained from the Liu et al 2016 covering both studies. The public data comprised 22,277 probe-sets, a subset of the 54,675 probe- sets available for hVIVO-1.
[000104] Principal Components Analysis (PCA) showed that the three transcriptomics data sets differed systematically (FIG. 5). It was concluded that direct pooling would lead to spurious results. An adjustment for study was therefore applied to the transcriptomics measurements and the PCA repeated (FIG. 5).
[000105] In order to exclude non-informative probe-sets, two groups namely, severe and non- severe were considered. For each molecule in each group, filter ratios were calculated reflecting variability over time, and variability across individuals. Under recommended thresholds, 13,806 probe- sets were informative in at least one group, and were explored further.
[000106] Differential expression analysis between subjects that developed acute symptoms of an influenza-like disease (n=10) and those that did not develop acute symptoms of an influenza-like disease (n=56) was performed by application of a cubic p-spline model (Straube et al). A test was applied for group x time interaction, and a total of 1052 transcripts had q<0.05 after adjustment for False Discovery Rate (FDR) (Benjamini et al 1995).
[000107] To develop a molecular signature nominal associations arising fiom cubic p-spline analysis were input into a variable selection process to determine the best predictors of developing acute symptoms of an influenza-like disease at three time-points post inoculation; Day 1, Day 2 morning, and Day 2 evening (Approximately 13 to 25, 37 to 49 and 49 to 61 hours post inoculation respectively).
[000108] Variable selection was performed by gradient boosting machines (GBM ; Friedman 2001 ;
Friedman 2002), and the number of molecules to be selected was limited to six for best results. FIG. 6 shows the process that was followed.
[000109] Logistic regression was applied to produce prediction models (signatures or gene panels or gene sub-panels) for the variables selected. Signature performance (sensitivity, specificity, positive and negative predictive value) was determined at the time-point on which the model was based, and at all other time-points considered. Sensitivity - the proportion of cases that receive a positive test result. Specificity - the proportion of cases that receive a negative test result. Positive predictive value (PPV) - the proportion of positive test results that are true positives. Negative predictive value (NPV) - the proportion of negative test results that are true negatives. The AUC (area under the receiver operating characteristic (ROC) curve) was determined.
Day 1 (AM) Signature
[000110] Table 3 shows the variables selected by gradient boosting, Table 4 shows the signature arising from logistic regression and lastly, Table 5 shows the test performance characteristics at all time- points considered. It can be seen that the signature, or gene sub-panel, performs well (AUG > 0.80) on Day 1 data.
[000111] This signature, or gene sub-panel, includes the genes PHF20, ABCA1, APBA2, M0RC2, SNU13, and DCUN1D2.
Table 3: Variables selected at Dav 1 (AM)
Table 4: Dav 1 (AM) signature
Table 5: Test performance of Dav 1 (AM) signature
Day 2 (AM) Signature
[000112] Table 6 shows the variables selected by gradient boosting, Table 7 shows the signature arising from logistic regression and lastly, Table 8 shows the test performance characteristics at all time- points considered. The signature performs well at Day 2 (AM). [000113] This signature, or gene sub-panel, includes the genes MAX, N0L9, MPRIP, HP, BST1,
TM9SF2.
Table 6: Variables selected at Dav 2 (AM)
Table 7: Dav 2 (AM) signature
Table 8: Test performance of Dav 2 (AM) signature
Day 2 (PM) Signature
[000114] Table 9 shows the variables selected by gradient boosting, Table 10 shows the signature arising from logistic regression and lastly, Table 11 shows the test performance characteristics at all time- points considered.
[000115] This signature, or gene sub-panel, includes the genes H0MER3, NSUN6, HP, EPHA4,
BMP2K. Table 9: Variables selected at Dav 2 (PM)
Table 10: Dav 2 (PM) signature
Table 11: Test performance of Dav 2 (PM) signature
[000116] Once the gene sub-panels were identified, it was possible to determine the minimum number of genes required to provide a good prediction of whether a subject would go on to develop acute symptoms of an influenza-like disease. An AUG value of 0.8 or higher provides a good prediction that a subject will develop acute symptoms of an influenza-like disease. The parameter with lowest relative influence was dropped from consideration and an updated signature was derived. Model performance (sensitivity, specificity, PPV and NPV) was derived for the updated model. A ROC curve was drawn and AUC was tabulated. Further model parameters were sequentially dropped in order of increasing relative influence. In this way, model performance was determined for signatures based upon 5 ,4, 3, 2 and 1 genes at different time points (Tables 12 to 14). It can be seen that any of 1, 2, 3, 4, 5, or 6 genes can be predictive of developing acute symptoms of an influenza-like disease.
Table 12 - Dav lam algorithm performance AUC values
Table 13 - Dav 2am algorithm performance AUC values
Table 14 - Dav 2pm algorithm performance AUC values Example 2
[000117] A group of individuals are recruited into a human viral challenge model and inoculated with a respiratory virus. It is beneficial to identify, in advance, subjects who will progress to develop acute symptoms of an influenza like disease, allowing selective dosing of these subjects with an investigational or licensed medicament (drug/vaccine/ placebo) at the earliest opportunity. This will improve the ability to detect a clinically relevant reduction in disease in response to the medicament by only evaluating the medicament effects in individuals that will/would have developed acute symptoms of an influenza-like disease. This will also reduce unnecessary exposure of subjects to an investigational medicament. This will also reduce the amount of medicament required.
[000118] Volunteers are screened for eligibility for the evaluation of efficacy of an investigational medicament in a human challenge study with a respiratory virus, in particular with influenza.
[000119] Eligible volunteers arrive at the clinic and baseline samples and clinical measures are taken, before they are exposed to virus (e.g. inoculation). Baseline values are obtained pre-inoculation using one or more blood samples over varying time-points.
[000120] Blood samples are taken regularly before and after virus exposure (e.g. paxgene RNA samples twice, three times a day, or more) alongside clinical measures of their disease.
[000121] Expression levels of specific gene panels and sub-panels are measured in real-time from the blood paxgenes. As in Example 1, blood is assessed for gene expression utilising Affymetrix HG- U133 Plus 2.0 microarray chips, which were used to measure the transcripts’ expression. Microarray data was pre-processed using RMA background correction and quantiles normalization. One can use the absolute value of each gene at a given time-point or alternatively where a baseline gene level was obtained and available, the gene levels post exposine to virus can be baseline normalised for each subject (i.e. compared to the subject’s baseline expression level for that gene or gene panel or gene sub-panel).
[000122] Three separate gene subpanels can be used (i.e. 3 algorithms) to identify which individuals will develop acute symptoms of an influenza-like disease a. The 3 gene sub panels are: i. Subpanel A: PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2 ii. Subpanel B: MAX, NOL9, MPRIP, HP, BST1 and TM9SF2 iii. Subpanel C: H0MER3, NSUN6, HP, EPHA4 and BMP2K
[000123] In one instance the different gene subpanels are used at different time points (FIG. 8, sub panel A, followed by sub panel B, followed by subpanel C). Alternatively, instead of using the subpanels sequentially 1, 2, or 3 gene sub panels are used at the same time (FIG. 7), and may be repeated at several points following inoculation. a. For both approaches, a positive result for any test immediately triggers dosing of the subject with the investigational medicament (drug/vaccine/placebo). b. Alternatively, two positive results, or three positive results, are required to trigger dosing.
[000124] For each gene sub-panel the stringency can be varied by modifying the threshold at which a positive result is obtained. For example, the threshold for gene sub-panel 1 could be set to be more stringent avoiding false positives. Gene sub-panel 2 is then set with lower stringency and gene sub- panel 3 with even lower stringency, thus increasing the chances of identifying and dosing all subjects who will develop acute symptoms of influenza-like disease as early as possible.
[000125] In addition to the use of the gene sub-panels, the results may be combined with a diagnostic test that confirms the subject has the respiratory viral infection relevant to the trial (e.g. a viral test).
[000126] In addition to the use of the gene sub-panels, the results may be combined with measurements of the change in variance/gradient of symptoms.
[000127] Other actions that can be triggered alongside dosing with a medicament include increasing the observations/samples/measurements in those predicted to develop acute symptoms of an influenza-like disease or reducing observations/samples/measurements in those who are predicted not to develop acute symptoms of an influenza-like disease.
[000128] By using this invention as part of the trial design decision making, only those subjects most likely to develop acute symptoms of influenza like disease are included in the statistical analysis of efficacy for the investigational medicament, which has the benefits as previously described.
[000129] In some instances, where the algorithms do not report a positive test result, the subjects may be dosed at a predetermined time point post exposine or inoculation (e.g. Day 4). These subjects form a further subgroup for analysis.
Example 3
[000130] A group of individuals are recruited into an efficacy field study and become infected in the community with a respiratory virus, in particular influenza. Following exposure, it would be beneficial to identify, in advance, subjects who will develop acute symptoms of an influenza like disease, allowing selective dosing of these subjects with an investigational or licensed medicament (drug/vaccine/ placebo) at the earliest opportunity. This would improve ability to detect clinically relevant reduction in disease by only evaluating the medicament effects in individuals that would have gone on to present with acute symptoms of an influenza-like disease. This will reduce unnecessary exposine of subjects to an investigational medicament as well as reducing the amount of medicament required.
[000131] In this example, volunteers have been screened for eligibility for the evaluation of the efficacy of an investigational medicament in a clinical field trial against a respiratory virus, in particular influenza.
[000132] Eligible volunteers arrive at the clinic and baseline samples and clinical measures are taken as they are enrolled in the study. Baseline values would be obtained using one or more blood samples over varying time-points post enrolment and prior to contracting a respiratory virus infection in the community.
[000133] Blood samples (e.g. paxgene RNA samples twice, three times a day, or more) are taken after virus exposure from a household contact that has a respiratory infection, or after showing initial symptoms of respiratory disease (the trial subjects may or may not be using a study questionnaire/symptom diary card that captures these symptoms).
[000134] Specific gene sub-panels are measured real-time in the blood paxgenes using the methods described in Examples 1 and 2. One can use the absolute value of each gene at a given time-point or alternatively where a baseline gene level was obtained and available, the gene levels post exposure to virus can be baseline normalised for each subject (i.e. compared to baseline).
[000135] Three separate gene sub-panels can be used (i.e. 3 algorithms) to identify which individuals will progress to have acute symptoms of an influenza-like disease a. The 3 subpanels of genes are: i. Subpanel A: PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2 ii. Subpanel B: MAX, NOL9, MPRIP, HP, BST1 and TM9SF2 iii. Subpanel C: HOMER3, NSUN6, HP, EPHA4 and BMP2K
[000136] In one instance 1, 2, or 3 gene sub-panels are used at the same (FIG. 7), and may be repeated at several points over time. a. a positive result for any test immediately triggers dosing of the subject with the investigational medicament (drug/vaccine/placebo). a. Alternatively, two positive results or three positive results are required to trigger dosing.
[000137] For each gene sub-panel the stringency can vary by modifying the threshold at which a positive result is obtained. For example, the threshold for gene sub-panel 1 could be set to be more stringent avoiding false positives. Gene sub-panel 2 is then set with lower stringency and gene sub-panel 3 with even lower stringency, thus increasing the chances of identifying and dosing all subjects who will develop acute symptoms of an influenza-like disease as early as possible.
[000138] In addition to the use of the gene sub-panels, the results may be combined with a diagnostic test that confirms the subject has the respiratory viral infection relevant to the trial (e.g. viral test).
[000139] In addition to the use of the gene sub-panels, where a symptom diary card is used the results may be combined with measurements of the change in variance/gradient of symptoms.
[000140] Other actions that can be triggered alongside dosing with a medicament include increasing the observations/samples/measurements in those predicted to go on develop acute symptoms of influenza-like disease or reducing observations/samples/measurements in those predicted to not develop acute symptoms of an influenza-like disease.
[000141] By using this invention as part of the trial design decision making, only those most likely to develop acute symptoms of influenza-like disease are included in the statistical analysis of efficacy for the investigational medicament, which has the benefits as previously described.
[000142] In some instances, where the algorithms do not report a positive, the subjects may be dosed at a predetermined time points post exposine (e.g. Day 4, Day 5), with these subjects forming a secondary subgroup analysis.
Example 4
[000143] Subjects may become infected with a respiratory virus in the community or exposed to an infected person for an extended period in the community. Community settings can include at home (family member, household contact), at work, in transit (e.g. on a train, coach, plane, ship), within a care home (fellow resident, family visitor, carer), as an inpatient in hospital (fellow inpatient, healthcare worker, visitor), within military setting (fellow personnel). Following exposure, it would be beneficial to identify, in advance, people who will progress to have acute symptoms of an influenza-like disease, allowing an intervention at the earliest opportunity. Interventions include assisting referral to healthcare professionals, enabling earlier treatment with an antiviral than would otherwise be possible (for example Tamiflu), administration of an immunomodulator drug or combination of antiviral and immunomodulator, separation fiom others (quarantine), inclusion in a study (IMP trial, transmission study), initiate sampling for disease/biomarker monitoring.
[000144] A subject becomes infected with a respiratory virus in the community or is exposed to an infected person for a prolonged period in the community.
[000145] Post viral exposure blood samples may be collected and gene levels quantified following one or more triggers: a. a positive diagnostic test (e.g. based on viral replication, diagnostic biomarker) b. initial symptoms of respiratory viral disease c. prolonged exposine to an infected contact
[000146] Specific gene panels or gene sub-panels are measured real-time in the blood sample (e.g. using a Point of Care test). The absolute value of each gene at a given time-point can be used or alternatively where a baseline gene level was obtained or is available, the gene levels post exposure to virus can be baseline normalised for each subject (i.e. compared to baseline).
[000147] In one scenario, three separate gene sub-panels are used (i.e. three algorithms) to identify which individuals are likely to develop acute symptoms of an influenza-like disease a. The three subpanels of genes are: i. Subpanel A: PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2 ii. Subpanel B: MAX, NOL9, MPRIP, HP, BST1 and TM9SF2 iii. Subpanel C: HOMER3, NSUN6, HP, EPHA4 and BMP2K
[000148] In another scenario one, two, or three gene sub-panels are used at the same (FIG. 7), and may be repeated at several time points. a. a positive result for any test may trigger one or more of the following: i. assisting referral to healthcare professionals, ii. enabling earlier treatment with an antiviral than would otherwise be possible (e.g.Tamiflu), iii. administration of an immunomodulator drug or combination of antiviral and immunomodulator, iv. separation from others (quarantine, using barriers to transmission e.g. masks), v. inclusion in a study (e.g. IMP trial, disease study, transmission study), vi. initiate sampling for disease/biomarker monitoring b. Alternatively, two positive results or three positive results are required to trigger the actions, as listed above.
[000149] For each gene sub-panel the stringency can vary by modifying the threshold at which a positive result is obtained. For example, the threshold for gene sub-panel 1 could be set to be more stringent avoiding false positives. Gene sub-panel 2 is then set with lower stringency and gene sub- panel 3 with even lower stringency, thus increasing the chances of identifying and intervening as early as possible.
[000150] In addition to the use of the gene sub-panels, the results may be combined with a diagnostic test that confirms the subject has the respiratory viral infection (if not already performed).
Gene sequences
[000151] Included below for each gene identified are the probe name specific to the GeneChip® Human Genome U133A 2.0 Array (Afiymetrix, Santa Clara, CA), the gene ID and the full sequence of the entire gene. It will be understood that the present invention is not limited to use of the GeneChip® Human Genome U133A 2.0 Array; the expression levels of the one or more genes disclosed herein may be measured using any suitable method known to those skilled in the art, employing any suitable probe or probes which are capable of binding uniquely to respective nucleic acid sequences that represent the one or more genes.
References
Benjamini, Yoav; Hochberg, Yosef (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B. 57 (1): 289-300
Cox NJ, Subbarao K. Influenza. Lancet 1999;354:1277-82
De Vincenzo et al 2014, Oral GS-5806 activity in a respiratory syncytial virus challenge study. N Engl J Med. 2014 Aug 21;371(8):711-22.
De Vincenzo et al 2015, Activity of Oral ALS-008176 in a Respiratory Syncytial Virus Challenge Study. N Engl J Med. 2015 Nov 19;373(21):2048-58.
Friedman JH (2001). Greedy function approximation: a gradient boosting machine. Arm Statist 29(5): 1189-1232.
Friedman JH (2002). Stochastic gradient boosting. Comput Stat Data Anal 38(4): 367-378.
Hodinka, “Respiratory RNA Viruses”, Microbiol Spectr., 2016 Aug; 4(4)
Lau, L. L. H , Cowling, B. J., Fang, V. J, Chan, K.-H., Lau, E. H. Y., Lipsitch, M. ... Leung, G.
M. (2010). Viral shedding and clinical illness in naturally acquired influenza virus infections. The Journal of Infectious Diseases, 201( 10), 1509- 1516
Liu, T.Y., et al., An individualized predictor of health and disease using paired reference and target samples. BMC Bioinformatics, 2016. 17: p. 47 Molinari, N. M., et al, (2007), The annual impact of seasonal influenza in the US: Measuring disease burden and costs, Volume 25. Issue 27. 28 June 2007, Pages 5086-5096
Rolfes MA, Foppa IM, Garg S, Flannery B, Brammer L, Singleton JA, et al. Estimated Influenza Illnesses, Medical Visits, Hospitalizations, and Deaths Averted by Vaccination in the United States. 2016 Dec 9
Straube J, Gorse A-D, PROOF Centre of Excellence Team et al (2015). A linear mixed model spline framework for analysing time course ‘omics data. PLoS ONE 10(8): eO 134540.
Wang, Z., Gerstein, M., & Snyder, M. (2009). RNA-Seq: a revolutionary tool for transcriptomics. Nature Reviews. Genetics, 10(1), 57-63.
Woods CW, McClain MT, Chen M, Zaas AK et al (2013). A host transcriptional signature for presymptomatic detection of infection in humans exposed to influenza H1N1 or H3N2. PLoS ONE 8(1): e52198.
Zaas, A.K., et al., Gene expression signatures diagnose influenza and other symptomatic respiratory viral infections in humans. Cell Host Microbe, 2009. 6(3): p. 207-17.

Claims

1. A method of predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposure, to a respiratory virus, which comprises analysing a biological sample obtained from the subject for a biomarker and comparing the biomarker to a reference for the biomarker, wherein the biomarker comprises or is derived from expression levels of one or more genes selected from a gene panel comprising PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, H0MER3, NSUN6, EPHA4 and BMP2K.
2. A method according to claim 1, wherein the gene panel consists of one, two, three, four, five, or six genes selected from PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, H0MER3, NSUN6, EPHA4 and BMP2K.
3. A method according to claim 1 or claim 2, wherein the biomarker comprises expression levels of one or more genes selected from a first gene sub-panel comprising PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2.
4. A method according to claim 3, wherein the first gene sub-panel comprises the expression level of PHF20.
5. A method according to claim 4, wherein the first gene sub-panel further comprises the expression level of one or both of APBA2 and ABCA1.
6. A method according to claim 5, wherein the first gene sub-panel further comprises the expression level of one, two or three of M0RC2, SNU13 and DCUN1D2.
4. A method according to any one of claims 3 to 6, wherein the first gene sub-panel consists of one, two, three, four, five, or six of PHF20, ABCA1, APBA2, M0RC2, SNU13 and DCUN1D2.
8. A method according to claim 1 or claim 2, wherein the biomarker comprises expression levels of one or more genes selected from a second gene sub-panel comprising MAX, NOL9, MPRIP, HP, BST1 and TM9SF2.
9. A method according to claim 8, wherein the second gene sub-panel comprises the expression level of one or more of NOL9, HP and MAX.
10. A method according to claim 9, wherein the second gene sub-panel further comprises the expression level of one or both of BST1 and MPRIP.
11. A method according to claim 10, wherein the second gene sub-panel further comprises the expression level of TM9SF2.
12. A method according to any one of claims 8 to 11, wherein the second gene sub-panel consists of one, two, three, four, five, or six of MAX, NOL9, MPRIP, HP, BST1 and TM9SF2.
13. A method according to claim 1 or claim 2, wherein the biomarker comprises expression levels of one or more genes selected from a third gene sub-panel comprising H0MER3, NSUN6, HP, EPHA4 and BMP2K.
14. A method according to claim 13, wherein the third gene sub-panel comprises the expression level of one or both of HP and H0MER3.
15. A method according to claim 14, wherein the third gene sub-panel further comprises the expression level of one or both of EPHA4 and BMP2K.
16. A method according to claim 15, wherein the third gene sub-panel further comprises the expression level of NSUN6.
17. A method according to any one of claims 13 to 16, wherein the third gene sub-panel consists of one, two, three, four or five of H0MER3, NSUN6, HP, EPHA4 and BMP2K.
18. A method according to any one of claims 3 to 17, wherein the biomarker is associated with the relative time course progression towards developing acute symptoms of disease, such that the first gene sub-panel is associated with an early stage dining progression towards developing acute symptoms of disease, the second gene sub-panel is associated with a middle stage during progression towards developing acute symptoms of disease, and the third gene sub-panel is associated with a later stage during progression towards developing acute symptoms of disease.
19. A method according to any one of the preceding claims, wherein the biological sample is obtained from the subject up to about 25 hours after exposure, or possible exposure, to the respiratory virus.
20. A method according to claim 19, wherein the biomarker comprises expression levels of one, two, three, four, five, six or more genes selected from the first gene sub-panel defined in any one of claims 3 to 7.
21. A method according to any one of claims 1 to 18, wherein the biological sample is obtained from the subject about 37-49 hours after exposure, or possible exposure, to the respiratory virus.
22. A method according to claim 21, wherein the biomarker comprises expression levels of one, two, three, four, five, six or more genes selected from the second gene sub-panel defined in any one of claims 8 to 12.
23. A method according to any one of claims 1 to 18, wherein the biological sample is obtained from the subject about 49-61 horns after exposure, or possible exposure, to the respiratory virus.
24. A method according to claim 23, wherein the biomarker comprises expression levels of one, two, three, four, five or more genes selected from the third gene sub-panel defined in any one of claims 13 to 17.
25. A method according to any one of the preceding claims, wherein the biomarker is computer- generated and comprises an output variable of a classification algorithm that uses as input variables the expression levels of one or more genes in the gene panel; or one or more genes in the first gene sub-panel; or one or more genes in the second gene sub-panel; or one or more genes in the third gene sub-panel.
26. A method according to claim 25, wherein the output variable includes a numerical value.
27. A method according to claim 25 or claim 26, wherein the classification algorithm is derived by machine-learning from a training data-set that uses as input variables expression levels of one or more genes from the gene panel measured from a biological sample obtained from a group of subjects at a predetermined time after exposure to the respiratory virus, wherein the group of subjects is divided into two classes according to whether or not they developed acute symptoms of disease after exposure to the respiratory virus, and wherein the classification algorithm operates on the expression levels to produce an output variable that differentiates between the classes.
28. A method according to claim 27, wherein the classification algorithm comprises a generalised regression-based algorithm or decision tree.
29. A method according to claim 28, wherein the classification algorithm is configured to prioritise accuracy.
30. A method according to claim 28, wherein the classification algorithm is configured to prioritise negative predictive value (NPV).
31. A method according to any one of claims 27 to 30, wherein the acute symptoms of disease in the subjects of the group in the training data set is assessed by evaluating one or more symptoms of disease at a series of pre-set times after exposure to the respiratory virus.
32. A method according to claim 31, wherein the one or more symptoms are evaluated by the subjects using diary cards, optionally visual analogue score symptom diary cards (VAS), or optionally categorical symptoms (CAT) are recorded using a modified standardized symptom score for example the modified Jackson Score.
33. A method according to claim 31 or claim 32, wherein the two classes of subjects in the training data set are differentiated by one or more parameters based on the evaluation of the one or more symptoms including runny nose, stuffy nose, sore throat, sneezing, earache, cough, shortness of breath, headache, malaise, myalgia, muscle and/or joint aches, chilliness, and feverishness.
34. A method according to claim 32 or claim 33, wherein the first class contains subjects which record a total VAS of greater than or equal to 25 units and/or a total CAT score of 10 units or greater.
35. A method according to any one of claims 32 to 34, wherein the first class contains subjects that show one or more of: greatest variance in total VAS or CAT up to the peak of symptoms; greatest variance in total VAS or CAT over the duration of quarantine; or steepest gradient (slope of regression line) of total VAS or CAT up to the peak of symptoms.
36. A method according to any one of claims 27 to 35, wherein the gene panels and gene sub-panels are selected by: i) analysing expression levels in biological samples obtained from the group of subjects in the data training set across the whole series of pre-set times after exposure to the virus; and ii) identifying genes that show a nominal association with acute symptoms of disease, and iii) using a variable selection process to select panels of the identified genes whose expression levels at a predetermined time after exposure to the virus exhibit maximal predictive value for developing acute symptoms of disease.
37. A method according to claim 36, wherein the variable selection process comprises subjecting the expression levels of the identified genes at the predetermined time after exposine to the respiratory virus to a repeated gradient boosting process and selecting a set of 1, 2, 3, 4, 5 or 6 genes that are selected most frequently by the gradient boosting process.
38. A method according to any one of the preceding claims, wherein the biomarker is compared to a baseline for the biomarker, wherein the baseline for the biomarker is determined prior to exposure, or possible exposure, of the subject to the respiratory virus.
39. A method according to any one of the preceding claims, wherein the subject has been administered a medicinal product before or after exposine or possible exposure to the respiratory virus.
40. A method according to any one of the preceding claims, wherein the subject has had a positive diagnostic test for respiratory viral disease, presents with symptoms of respiratory viral disease, and/or has had prolonged exposure to at least one other person who is infected with a respiratory virus.
41. A method according to any one of the preceding claims wherein the subject is tested two or more times for the same or a different biomarker as defined in any one of claims 1 to 31 and the subject is indicated as predicted to develop acute symptoms if the result of at least one or more than one of the tests are positive.
42. A method according to claim 41, wherein the thresholds at which a positive result is obtained are different for the two or more tests, the threshold for at least one test being configured to minimise false positives, and the threshold for at least another test being configured to have fewer false negatives than the one test.
43. A method according to any one of the preceding claims, further comprising administering a therapeutic or prophylactic treatment to the subject if they are predicted to develop acute symptoms.
44. A method according to claim 43, wherein the treatment comprises administration of an antiviral or immunomodulatory agent.
45. A method of predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposure, to a respiratory virus, which comprises estimating time elapsed after the exposure, or possible exposure, to the respiratory virus by analysing expression levels of one or more genes selected from PHF20, ABCA1, APBA2, M0RC2, SNU13, DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, H0MER3, NSUN6, EPHA4, in a biological sample obtained from the subject; selecting a biomarker as defined in any one of claims 1 to 37, which at said time exhibits maximal predictive value for developing acute symptoms of disease; and comparing the biomarker to a reference for the biomarker.
46. A method of conducting a clinical trial or field study in which a group of subjects are exposed to a respiratory virus, the method comprising analysing a biomarker as defined in any one of claims 1 to 237 for each subject and comparing the biomarker to a reference for the biomarker to predict whether the subject is likely to develop acute symptoms of disease, and including subjects who are predicted to develop acute symptoms of disease in a first subgroup of the clinical trial or field study.
47. A method according to claim 46, wherein the biomarker is compared to a baseline for the biomarker, wherein the baseline for the biomarker is determined prior to exposure, or possible exposine, of the subjects to the respiratory virus.
48. A method according to claim 46 or claim 47, wherein subjects in the first subgroup are administered a medicament after being predicted to develop acute symptoms of disease.
49. A method according to any one of claims 46 to 48, further comprising including subjects who are predicted not to develop acute symptoms of influenza-like disease in a second subgroup.
50. A method according to claim 49, wherein subjects in the second subgroup are not administered a medicament during the trial or study, or are administered a medicament at a predetermined time after commencing the trial or study.
51. A method according to any one of the preceding claims, wherein the respiratory virus is respiratory syncytial virus (RSV), parainfluenza virus (HPIV), metapneumovirus (HMPV), rhinovirus (HRV), coronavirus, adenovirus (HAdV), enterovirus (EV), bocavirus (HBoV), parechovirus (HPeV) or an influenza virus.
52. A method according to any one of the preceding claims, wherein the biological sample is a blood or respiratory sample.
53. A method according to any one of the preceding claims, wherein the expression level of the one or more genes is measured by quantifying mRNA transcripts of the one or more genes in the biological sample.
54. A method according to any one of the preceding claims, wherein the mRNA transcripts in a biological sample are quantified by one or more of; PCR based methods such as RT-qPCR; or gene expression microarray; or RNA-seq.
55. A computer program for predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposure, to a respiratory virus, which comprises instructions which, when the program is executed by a computer, cause the computer to generate a biomarker as defined in any one of claims 1 to 31.
56. A computer program according to claim 55, wherein the computer program compares the biomarker to a reference for the biomarker.
57. A computer program according to claim 55 or claim 56, wherein the computer program compares the biomarker to a baseline for the biomarker.
58. A classification algorithm for predicting whether a subject will develop acute symptoms of disease after exposure, or possible exposine, to a respiratory virus, wherein the classification algorithm is derived by analysing expression levels of one or more genes in subjects who have developed acute symptoms of disease and comparing with the expression levels in subjects who do not develop acute symptoms of disease, wherein the one or more genes are PHF20, ABCA1, APBA2, MORC2, SNU13 DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
59. A classification algorithm according to claim 58, wherein the acute symptoms of disease in a subject are assessed by evaluating one or more symptoms of disease.
60. A classification algorithm according to claim 58 or 59, wherein the classification algorithm is computer-implemented and comprises receiving in a computer a data set comprising expression levels of the one or more genes from one or more subjects and executing on the computer software to predict whether the one or more subjects will develop acute symptoms of disease.
61. A computer readable medium and/or computer program comprising instractions which, when executed by a computer, cause the computer to carry out the classification algorithm according to any one of claims 58 to 60.
62. A computer-implemented method for predicting whether a subject will develop acute symptoms of disease wherein a biomarker is generated by analysing expression levels of one or more genes in subjects who have developed acute symptoms of disease following inoculation with a respiratory virus and comparing with the expression levels in subjects who do not develop acute symptoms of disease following inoculation with a respiratory virus, wherein the one or more genes are PHF20, ABCA1, APBA2, MORC2, SNU13 DCUN1D2, MAX, NOL9, MPRIP, HP, BST1, TM9SF2, HOMER3, NSUN6, EPHA4 and BMP2K.
63. A computer implemented method according to claim 62, wherein the method comprises a graphical user interface which displays the biomarker to the user.
EP22731779.9A 2021-06-02 2022-06-01 Biomarkers and methods for classifying subjects following viral exposure Pending EP4347891A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB2107883.7A GB202107883D0 (en) 2021-06-02 2021-06-02 Biomarkers and method for classifying subjects following viral exposure
PCT/GB2022/051405 WO2022254221A1 (en) 2021-06-02 2022-06-01 Biomarkers and methods for classifying subjects following viral exposure

Publications (1)

Publication Number Publication Date
EP4347891A1 true EP4347891A1 (en) 2024-04-10

Family

ID=76741435

Family Applications (1)

Application Number Title Priority Date Filing Date
EP22731779.9A Pending EP4347891A1 (en) 2021-06-02 2022-06-01 Biomarkers and methods for classifying subjects following viral exposure

Country Status (6)

Country Link
EP (1) EP4347891A1 (en)
JP (1) JP2024526045A (en)
CN (1) CN117480262A (en)
CA (1) CA3220934A1 (en)
GB (1) GB202107883D0 (en)
WO (1) WO2022254221A1 (en)

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6040138A (en) 1995-09-15 2000-03-21 Affymetrix, Inc. Expression monitoring by hybridization to high density oligonucleotide arrays
EP1373569A4 (en) 2001-03-02 2006-02-08 Univ Pittsburgh Pcr method
US20130165470A1 (en) * 2011-12-21 2013-06-27 The Procter & Gamble Company Methods for Detecting and Treating Rhinovirus Infection
CA2989199A1 (en) * 2015-07-01 2017-01-05 Duke University Methods to diagnose and treat acute respiratory infections
DK3356558T3 (en) * 2015-09-30 2022-04-25 Immunexpress Pty Ltd SIRS PATHOGENBIOM MARKERS AND USES THEREOF
EP3504344A1 (en) * 2016-08-24 2019-07-03 Immunexpress Pty Ltd Systemic inflammatory and pathogen biomarkers and uses therefor

Also Published As

Publication number Publication date
WO2022254221A1 (en) 2022-12-08
JP2024526045A (en) 2024-07-17
CN117480262A (en) 2024-01-30
GB202107883D0 (en) 2021-07-14
CA3220934A1 (en) 2022-12-08

Similar Documents

Publication Publication Date Title
Bartoletti et al. Development and validation of a prediction model for severe respiratory failure in hospitalized patients with SARS-CoV-2 infection: a multicentre cohort study (PREDI-CO study)
Suarez et al. Superiority of transcriptional profiling over procalcitonin for distinguishing bacterial from viral lower respiratory tract infections in hospitalized adults
Tabone et al. Blood transcriptomics reveal the evolution and resolution of the immune response in tuberculosis
Sampson et al. A four-biomarker blood signature discriminates systemic inflammation due to viral infection versus other etiologies
EP3316875B1 (en) Methods to diagnose acute respiratory infections
US8821876B2 (en) Methods of identifying infectious disease and assays for identifying infectious disease
Shojaei et al. IFI27 transcription is an early predictor for COVID-19 outcomes, a multi-cohort observational study
Stein et al. Performance characteristics of clinical diagnosis, a clinical decision rule, and a rapid influenza test in the detection of influenza infection in a community sample of adults
Yu et al. Host gene expression in nose and blood for the diagnosis of viral respiratory infection
Kaforou et al. Transcriptomics for child and adolescent tuberculosis
Sintchenko et al. Treat or test first? Decision analysis of empirical antiviral treatment of influenza virus infection versus treatment based on rapid test results
Bruminhent et al. Clinical characteristics and risk factors for coronavirus disease 2019 (COVID-19) among patients under investigation in Thailand
US20230348980A1 (en) Systems and methods of detecting a risk of alzheimer&#39;s disease using a circulating-free mrna profiling assay
Redei et al. Blood transcriptomic markers for major depression: from animal models to clinical settings
WO2018154075A1 (en) Methods for classifying subjects exposed to viral infection
Chen et al. Factors associated with viral rebound among COVID-19 patients receiving oral antivirals
WO2022235518A9 (en) Method for diagnosing active tuberculosis and progression to active tuberculosis
EP4347891A1 (en) Biomarkers and methods for classifying subjects following viral exposure
Paparazzo et al. Thymic function and survival at advance ages in nursing home residents from Southern Italy
JP2021517892A (en) Precision Medicine for Pain: Diagnostic Biomarkers, Pharmacological Genomics, and Lipapas Drugs
Zhuang et al. Can the nucleic acid Ct value of discharged patients infected with SARS-CoV-2 Omicron variant be 35?——A retrospective study on fluctuation of nucleic acid Ct values in SNIEC mobile cabin hospital
Hoang et al. Investigation of target sequencing of SARS-CoV-2 and immunogenic GWAS profiling in host cells of COVID-19 in Vietnam
Sahoo et al. An AI-guided signature reveals the nature of the shared proximal pathways of host immune response in MIS-C and Kawasaki disease
WO2021252864A1 (en) Methods to detect and treat sars-cov-2 (covid19) infection
Sahajpal et al. Making a difference: adaptation of the clinical laboratory in response to the rapidly evolving COVID-19 pandemic

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: UNKNOWN

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20231114

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

RAP3 Party data changed (applicant data changed or rights of an application transferred)

Owner name: POOLBEG PHARMA (UK) LIMITED

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)