WO2009114292A1 - Copd biomarker signatures - Google Patents
Copd biomarker signatures Download PDFInfo
- Publication number
- WO2009114292A1 WO2009114292A1 PCT/US2009/035376 US2009035376W WO2009114292A1 WO 2009114292 A1 WO2009114292 A1 WO 2009114292A1 US 2009035376 W US2009035376 W US 2009035376W WO 2009114292 A1 WO2009114292 A1 WO 2009114292A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- copd
- classification
- protein
- markers
- tnf
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/20—Supervised data analysis
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2800/00—Detection or diagnosis of diseases
- G01N2800/12—Pulmonary diseases
- G01N2800/122—Chronic or obstructive airway disorders, e.g. asthma COPD
Definitions
- the present invention relates generally to methods for the rapid detection and accurate diagnosis of chronic obstructive pulmonary disease (COPD). More specifically, it relates to biomarker signatures indicative of COPD and detecting the differential expression of one or more proteins within the biomarker signatures to classify a test sample.
- COPD chronic obstructive pulmonary disease
- COPD chronic obstructive pulmonary disease
- the gold standard diagnostic parameter of COPD is declining FEVl (forced expiratory volume in 1 second) measured by spirometry (Rennard, 1998, Chest 113:235S-241S). In normal non-smokers and smokers, a slow but progressive decline in FEVl is observed ( ⁇ 20 ml/year), hi about 15% of smokers, lung function declines more rapidly than normal, leading to an accelerated limitation in physical performance (poor exercise tolerance) and dyspnea (Rennard, 1998, supra). The rate of decline between COPD patients can vary greatly.
- U.S. Patent Application Publication No. US 2006/0211026 discloses methods for diagnosing emphysema and COPD and assessing the efficacy of therapeutic drug candidates by determining if the relative level of a biomarker in a biological sample is higher or lower than the expected level, wherein the biomarker is selected from the group consisting of specific SpB, desmosine, VEGF, IGFBP2, MMP12, TIMPl, MMP9, Crabp2, Rbpl, Cyp26al, Tgm2, Timp3, Adaml7, Serpinal, Slpi, CoIIaIl 5 EIn, TGF ⁇ l, TGF ⁇ -RII, Sftpal, Sftpb, Csf2, Cxcll, Cxcl2, Cxcl5, IL-8R ⁇ , IL-8R ⁇ , IL- 6, TNF, EGF-R, Areg, PDGF ⁇ , Hp
- PCT Application Publication No. WO 2006/118522 discloses the identification and use of peptides derived from zinc- ⁇ 2-glycoprotein, ⁇ l -antitrypsin, collagen type III, prostaglandin-H2 D isomerase, collagen type I, ⁇ l -microglobulin, FGF, osteopontin, ⁇ l-acid glycoprotein 2 and fibrinogen alpha-E chain as biomarkers for COPD.
- PCT Application Publication No. WO 2008/003066 discloses methods of identifying COPD markers from informative content repositories and the use of said markers to diagnose COPD, including assessing disease progression among rapid and slow decline conditions.
- COPD chronic obstructive pulmonary disease
- the multi-analyte panels of protein biomarkers found in plasma samples of patients diagnosed with COPD help to support a positive or negative diagnosis of COPD, as well as to classify the severity of a positive COPD diagnosis (e.g., rapidly declining COPD and slowly declining COPD).
- a positive COPD diagnosis e.g., rapidly declining COPD and slowly declining COPD.
- greater than 90% accuracy in making a correct diagnosis and/or classification is provided by the disclosed methods.
- Circulating proteins are identified and described herein that are differentially expressed in COPD patients.
- the circulating plasma markers include apolipoprotein H 5 CD40, haptoglobin, interleukin-8 ("IL-8”), monocyte chemoattractant protein- 1 ("MCP-I "), tumor necrosis factor receptor II (“TNF-RII”), apolipoprotein CIII, granulocyte-macrophage colony stimulating factor (“GM-CSF”), immunoglobulin A ("IgA”), macrophage inflammatory protein 1 -alpha (“MIP-I ⁇ "), tissue factor, tumor necrosis factor-alpha (“TNF- ⁇ ”), alpha- 1 antitrypsin, C- react ⁇ ve protein (“CRP”), fibrinogen, interleukin-4 (“IL-4"), macrophage-derived chemokine (“MDC”), and soluble vascular cell adhesion molecule 1 (“sVCAM-1”).
- apolipoprotein H 5 CD40 haptoglobin, inter
- biomarker signatures comprising a subset of the plasma markers listed above are disclosed which enable one to distinguish between (a) rapidly declining COPD patients and controls (Signature 1), (b) slowly declining COPD patients and controls (Signature 2), and (c) rapidly declining COPD patients and slowly declining COPD patients (Signature 3).
- the present invention provides methods of detecting differentialprotein expression indicative of COPD in a test sample, wherein said test sample includes but is not limited to blood or a blood derivative (e.g. , plasma).
- the detection of circulating levels of proteins identified herein can classify patients as having COPD (diagnosis), as well as classify patients diagnosed with COPD according to the severity of the disease (disease monitoring). Such classification can be used in prediction of a response to a therapeutic to aid in the development of successful therapies to treat COPD and to help provide an early assessment of drug efficacy in clinical trials (i.e., following therapeutic regimens in patients).
- measurement of expression of the disclosed biomarkers can be determined after a patient has been exposed to a therapy, which may include, for example, drug therapy, combination drug therapy and non-pharmacologic intervention. Evaluation of the biomarker signatures disclosed herein, or a subset of biomarkers thereof, provides a level of discrimination not found with individual markers.
- the expression profile is determined by measurement of protein concentrations or amounts.
- the present invention provides for a computer-implemented method of classifying a test sample obtained from a mammalian subject, comprising (a) obtaining a dataset associated with said test sample, wherein said obtained dataset (i.e., test dataset) comprises quantitative data for at least three protein markers selected from a multi- analyte panel selected from the group consisting of: (i) apolipoprotein H, CD40, haptoglobin, interleukin-8 (IL-8), monocyte chemoattractant protein- 1 (MCP-I) and tumor necrosis factor receptor II (TNF-RII); (ii) apolipoprotein CIII, CD40, granulocyte-macrophage colony stimulating factor (GM-CSF), haptoglobin, immunoglobulin A (IgA), macrophage inflammatory protein lalpha (MIP-Ia), tissue factor and tumor necrosis factor-alpha (TNF- ⁇ ); and, (iii) al ⁇ ha-1 antitrypsin
- the present invention relates to a method for classifying a test sample obtained from a human subject, comprising: (a) obtaining a dataset associated with said test sample, wherein said obtained dataset comprises quantitative data for at least three protein markers selected from a multi-analyte panel selected from the group consisting of: (i) apolipoprotein H, CD40, haptoglobin, interleukin-8 (IL-8), monocyte chemoattractant protein- 1 (MCP-I) and tumor necrosis factor receptor II (TNF-RII); (ii) apolipoprotein CIII, CD40, granulocyte-macrophage colony stimulating factor (GM-CSF), haptoglobin, immunoglobulin A (IgA) 5 macrophage inflammatory protein 1 alpha (MIP- l ⁇ ), tissue factor and tumor necrosis factor-alpha (TNF- ⁇ ); and, (iii) alpha- 1 antitrypsin, C-reactive protein (CRP), fibr
- Methods of analysis disclosed herein include, without limitation, utilizing one or more reference datasets to generate a predictive model.
- Test sample data is compared to the predictive model in order to classify the sample, wherein said classification is selected from the group consisting of a COPD classification and a healthy classification.
- a COPD classification can represent a rapidly declining COPD classification or a slowing declining COPD classification.
- a predictive model of the invention utilizes quantitative data of one or more sets of markers described herein obtained from a reference population.
- the quantitative data represents protein concentrations or relative amounts of the protein biomarkers (e.g., as measured via a suitable detection method) within the disclosed biomarker signatures, or a subset of protein biomarkers thereof, in blood or a blood derivative (e.g., plasma).
- a predictive model can provide for a level of accuracy in classification wherein the model satisfies a desired quality threshold.
- a quality threshold of interest may provide for an accuracy of a given threshold and may be referred to herein as a quality metric.
- a predictive model may provide a quality metric, e.g. accuracy of classification, of at least about 70%, at least about 80%, at least about 90%, or higher. Within such a model, parameters may be appropriately selected so as to provide for a desired balance of sensitivity and selectivity.
- such a predictive model can be used in methods to classify a test sample obtained from a mammalian subject as being derived from a healthy individual or an individual with COPD, in particular rapidly declining COPD.
- a first step in said method is to obtain a dataset associated with a blood or a blood derivative (e.g., plasma) sample, wherein the dataset comprises quantitative data for at least three, or at least four, or at least five, or all six protein markers selected from the group consisting of apolipoprotein H, CD40, haptoglobin, interleukin-8 (IL- 8), monocyte chemoattractant protein- 1 (MCP-I) and tumor necrosis factor receptor II (TNF-RII).
- test sample dataset is then compared to a reference dataset containing quantitative data from the identical group of protein markers obtained from one or more reference samples used to generate the predictive model.
- both the test dataset and the reference dataset(s) used to generate the predictive model comprise quantitative data for at least the three plasma markers apolipoprotein H, MCP-I and TNF-RII.
- the quantitative data is a measurement of protein concentration.
- a predictive model is used in classifying a test sample obtained from a mammalian subject by obtaining a dataset associated with a blood or blood derivative (e.g., plasma) sample, wherein the dataset comprises quantitative data for at least three, or at least four, or at least five., or at least six, or at least seven, or all eight protein markers selected from the group consisting of apolipoprotein CIII, CD40, haptoglobin, granulocyte- macrophage colony stimulating factor (GM-CSF), immunoglobulin A (IgA), macrophage inflammatory protein 1 alpha (MIP- l ⁇ ), tissue factor and tumor necrosis factor-alpha (TNF- ⁇ ).
- a blood or blood derivative e.g., plasma
- the dataset comprises quantitative data for at least three, or at least four, or at least five., or at least six, or at least seven, or all eight protein markers selected from the group consisting of apolipoprotein CIII, CD40, haptoglobin, granulocyte-
- the test sample dataset is compared to an identical dataset (i.e., a reference dataset comprised of quantitative data from the identical group of protein biomarkers) obtained from one or more reference samples used to generate the predictive model.
- a reference dataset comprised of quantitative data from the identical group of protein biomarkers
- This method will classify the sample as being derived from a healthy individual or an individual with COPD, in particular slowly declining COPD.
- both the test dataset and the reference dataset(s) used to generate the predictive model comprise quantitative data for at least the three plasma markers IgA, MIP- l ⁇ and tissue factor.
- the quantitative data is a measurement of protein concentration.
- a predictive model is used in classifying a test sample obtained from a mammalian subject as being derived from an individual with a certain severity of COPD, particularly rapidly declining COPD or slowly declining COPD.
- a first step in said method is obtaining a dataset associated with a blood or blood derivative (e.g., plasma) sample, wherein the dataset comprises quantitative data for at least three, or at least four, or at least five, or at least six, or at least seven, or at least eight, or all nine protein markers selected from the group consisting of alpha- 1 antitrypsin, C-reactive protein (CRP), fibrinogen, granulocyte- macrophage colony stimulating factor (GM-CSF), interleukin-4 (IL-4), macrophage-derived chemokine (MDC), tissue factor, tumor necrosis factor receptor II (TNFRII) and soluble vascular cell adhesion molecule 1 (sVCAM-1).
- CRP C-reactive protein
- GM-CSF granulocyte- macrophage colony stimulating
- test sample dataset is compared to an identical dataset (i.e., a. reference dataset comprised of quantitative data from the identical group of protein biomarkers) obtained from one or more reference samples used to generate the predictive model.
- dataset comprises quantitative data for at least the three plasma markers MDC, tissue factor and sVCAM-1.
- the quantitative data is a measurement of protein concentration.
- Figure 1 illustrates four examples of plasma markers, (A) eotaxin, (B) IL-4, (C)
- MCP-I and (D) sVCAM-1 present at significantly different levels between COPD rapid decliners ("+”) and COPD slow decliners ("0") and/or healthy subjects ("•”) as determined by univariate analysis.
- Figure 2 illustrates a graphical output from a linear discriminant analysis (LDA) using different signatures, (A) COPD rapid decliners and healthy controls, (B) COPD slow decliners and healthy controls, and (C) COPD rapid and slow decliners ("*", control (CTL); "+”, COPD or COPD rapid; "0", COPD slow).
- LDA analysis measures the distance from each point in the data set to each group's multivariate mean and classifies the point to the closest group.
- the distance measure used is the Mahalanobis distance, which takes into account the variance and covariance between the variables.
- Each multivariate mean is a labeled circle. The size of the circle corresponds to a 95% confidence limit for the mean. Groups that are significantly different tend to have non-intersecting circles.
- COPD is a disease that slowly progresses over decades with significant limitations in lung function appearing only at a late stage of the disease, early detection of biomarkers capable of 1) helping with the diagnosis of COPD, 2) predicting the rate of lung function decline, and 3) monitoring drug efficacy in a timely manner in clinical trials would be very useful.
- biomarkers capable of 1) helping with the diagnosis of COPD, 2) predicting the rate of lung function decline, and 3) monitoring drug efficacy in a timely manner in clinical trials would be very useful.
- LDA Linear Discriminant Analysis
- COPD biomarkers detectable in plasma are preferable since blood is easily obtained and more reliable than lung secretions contained in bronchoalveolar lavage or induced sputum.
- COPD results from a general defect in the regulation of inflammation that manifests in the lung irritated by smoke or an uncontrolled inflammatory process that specifically takes place in the lung with systemic overflow.
- peripheral blood represents a convenient and rich source of information on COPD progression.
- Signature 1 is a multi-analyte panel composed of the following 6 protein markers: apolipoprotein H, CD40, haptoglobin, interleukin-8 (IL-8), monocyte chemoattractant protein- 1 (MCP-I) and tumor necrosis factor receptor II (TNF-RII).
- Signature 1 can correctly identify a plasma sample from a COPD rapid decliner in 93% of the cases (sensitivity) and a plasma sample from a healthy subject in 86% of the cases (specificity), for an overall accuracy in distinguishing between the two groups of 90% (see Table 6 in Example 3, infra).
- Signature 2 is a multi-analyte panel composed of the following 8 protein markers: apolipoprotein CIII, CD40, granulocyte-macrophage colony stimulating factor (GM-CSF), haptoglobin, immunoglobulin A (IgA), macrophage inflammatory protein 1 alpha (MIP- lot), tissue factor and tumor necrosis factor-alpha (TNF- ⁇ ) (see Table 6 in Example 3, infra).
- Signature 2 can correctly identify a plasma sample from a COPD slow decliner (versus a healthy subject) with a 91% sensitivity and a 96% specificity, for an overall accuracy of 94%.
- Signature 3 is a multi-analyte panel composed of the following 9 protein markers: alpha- 1 antitrypsin, C-reactive protein (CRP), fibrinogen, granulocyte-macrophage colony stimulating factor (GM-CSF) 5 interleukin-4 (IL-4), macrophage-derived chemokine (MDC), tissue factor, tumor necrosis factor receptor II (TNF-RII) and vascular cell adhesion molecule 1 (sVCAM-1).
- CRP C-reactive protein
- fibrinogen fibrinogen
- GM-CSF granulocyte-macrophage colony stimulating factor
- IL-4 interleukin-4
- MDC macrophage-derived chemokine
- tissue factor tissue factor
- TNF-RII tumor necrosis factor receptor II
- sVCAM-1 vascular cell adhesion molecule 1
- Signature 3 can correctly identify that a plasma sample is from a COPD rapid decliner in 95% of the cases (sensitivity) and that a plasma sample is from a COPD slow decliner in 86% of the cases (specificity), for an overall accuracy of distinguishing between the two groups of 92% (see Table 6 in Example 3, infra).
- Table 1 provides further information about the proteins identified in Signatures 1- 3 disclosed herein.
- the listed accession numbers correspond to entries within the National Center for Biotechnology Information (NCBI) database maintained by the National Institutes of Health.
- Pre-translationally modified forms include allelic variants, splice variants and RNA editing forms.
- Post- translationally modified forms include forms resulting from proteolytic cleavage (e.g., cleavage of a signal sequence or fragments of a parent protein), glycosylation, phosphorylation, lipidation, oxidation, methylation, cysteinylation, sulphonation and acetylation.
- the invention also contemplates the detection in a test sample of naturally occurring variants that are at least 90%, or at least 95%, or at least 97%, identical to the exemplified biomarker sequences (either nucleotide or protein sequences) listed in Table 1.
- Said biomarker variants shall have utility for the methods of the present invention and shall be detected via methods, as disclosed herein, used to detect the original biomarkers listed in column 1 of Table 1 ⁇ e.g., cross-reactivity with antibodies specific to the protein biomarkers listed in Table 1).
- These variants include but are not limited to polymorphisms, splice variants and mutations.
- percent identity in the context of two or more nucleic acid or polypeptide sequences, refers to two or more sequences or subsequences that have a specified percentage of nucleotides or amino acid residues that are the same, when compared and aligned for maximum correspondence, as measured using a sequence comparison algorithm ⁇ e.g., BLASTP and BLASTN or other algorithms available to persons of skill) or by visual inspection. Depending on the application, the percent identity can exist over a region of the sequence being compared ⁇ e.g., over a functional domain) or, alternatively, exists over the fall length of the two sequences to be compared. For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared.
- test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated.
- sequence comparison algorithm calculates the percent sequence identity for the test sequence(s) relative to the reference sequence.
- Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman (1981, Adv. Appl. Math. 2:482), by the homology alignment algorithm of Needleman and Wunsch (1970, J MoI Biol 48:443). by the search for similarity method of Pearson and Lipman (1988, Proc. Natl Acad.
- a protein biomarker is characterized by molecular weight and/or its known protein identity. Protein biomarkers can be resolved from other proteins in a sample by using a variety of fractionation techniques, e.g., chromatographic separation coupled with mass spectrometry, protein capture using immobilized antibodies and traditional immunoassays. Detection paradigms that can be employed to this end include optical methods, electrochemical methods (voltametry and amperometry techniques), atomic force microscopy, and radio frequency methods, e.g., multipolar resonance spectroscopy.
- Illustrative of optical methods in addition to microscopy, both confocal and non-confocal, are detection of fluorescence, luminescence, chemiluminescence, absorbance, reflectance, transmittance, and birefringence or refractive index (e.g., surface plasmon resonance, ellipsometry, a resonant mirror method, a grating coupler waveguide method and interferometry).
- a sample e.g. , a test sample as described herein
- the ability to differentiate between both different proteins and different forms of the same protein depends upon the nature of the differences between the proteins and the method of detection used.
- an immunoassay using a monoclonal antibody will detect all forms of a protein containing the eptiope and will not distinguish between them.
- a sandwich immunoassay that uses two antibodies directed against different epitopes on a protein will detect all forms of the protein that contain both epitopes and will not detect those forms that contain only one of the epitopes.
- a particular form (or a subset of particular forms) of a protein is a better biomarker than the collection of different forms detected together by a particular method, the power of the assay may suffer.
- it may be useful to employ an assay method that distinguishes between forms of a protein and that specifically detects and measures a desired form or forms of the protein. Distinguishing different forms of a protein analyte or specifically detecting a particular form of a protein analyte is referred to as "resolving" the analyte.
- blood samples or samples derived from blood (e.g. plasma, serum) are assayed for the presence of one or more of the protein markers disclosed as members of the multi-analyte biomarker signatures described herein.
- a blood sample is drawn, and a derivative product, such as plasma or serum, is tested.
- protein biomarkers may be detected through the use of specific binding members. For example, the use of antibodies for this purpose is of particular interest.
- Various formats find use for such assays, including the following: antibody arrays; enzyme-linked immunosorbent assays (ELISA) and radioimmunoassay (RIA) formats; binding of labeled antibodies in suspension/solution and detection by a method which includes, but is not limited to, flow cytometry and mass spectroscopy.
- Detection may utilize one or a panel of antibodies, preferably a panel of antibodies in an array format.
- the biomarkers of the present invention can also be detected by mass spectrometry, a method that employs a mass spectrometer to detect gas phase ions.
- mass spectrometers are time-of-flight, magnetic sector, quadrupole filter, ion trap, ion cyclotron resonance, electrostatic sector analyzer and hybrids of these.
- the mass spectrometer can be a laser desorption/ionization (LDI) mass spectrometer.
- LIDI laser desorption/ionization
- the protein analytes are placed on the surface of a mass spectrometry probe, a device adapted to engage a probe interface of the mass spectrometer and to present an analyte to ionizing energy for ionization and introduction into a mass spectrometer.
- a laser desorption mass spectrometer employs laser energy, typically from an ultraviolet laser, but also from an infrared laser, to desorb analytes from a surface, to volatilize and ionize them and make them available to the ion optics of the mass spectrometer.
- the analyis of proteins by LDI can take the form of MALDI or of SELDI (see U.S. Publication No. US20070172902 in the name of Zhang et al).
- Mass spectrometry is also a particularly powerful methodology to resolve different forms of a protein because the different forms typically have different masses that can be resolved by the technique.
- mass spectrometry may be able to specifically detect and measure the useful form where a traditional immunoassay fails both to distinguish the forms and to specifically detect the useful biomarker.
- Mass spectrometry can also be combined with an immunoassay.
- a biospecific capture reagent e.g., an antibody that recognizes the biomarker and forms of it
- the biospecific capture reagent is bound to a solid phase, such as a bead, a plate, a membrane or an array.
- a solid phase such as a bead, a plate, a membrane or an array.
- mass spectrometry Various forms of mass spectrometry are useful for detecting the protein forms, as described above, including laser desorption approaches, such as traditional MALDI or SELDI, and electrospray ionization.
- Biochips generally comprise solid substrates and have a generally planar surface to which a capture reagent (also called an adsorbent or affinity reagent) is attached. Frequently, the surface of a biochip comprises a plurality of addressable locations, each of which has the capture reagent bound there.
- a capture reagent also called an adsorbent or affinity reagent
- Protein biochips are biochips adapted for the capture of polypeptides. Many protein biochips are described in the art. These include, for example, protein biochips produced by Ciphergen Biosystems, Inc.
- the present invention discloses methods used for rapid detection and/or accurate diagnosis of chronic obstructive pulmonary disease (COPD) in a subject, classifying and/or identifying samples derived from a subject diagnosed with COPD according to the severity of the disease, and identifying and assessing the extent of COPD progression in a subject (disease monitoring/staging).
- Classifying and/or identifying test samples as being derived from healthy controls, rapidly declining COPD patients or slowly declining COPD patients can also be used to predict and/or monitor a response to a therapeutic regimen, including but not limited to monitoring drug efficacy during a clinical trial.
- the present invention includes methods of evaluating the efficacy of therapeutic agents and therapeutic regimens; disease staging and classification; and the like. Early detection can be used to determine the occurrence of developing COPD, thereby allowing for intervention with appropriate preventive or protective measures.
- test sample in methods of classifying and/or identifying a test sample as being obtained from a subject with COPD (either slowing declining or rapidly declining COPD) 5 diagnosing a patient with COPD, or classifying the severity of COPD in a patient
- the expression pattern in a test sample including but not limited to blood, serum and plasma, of one or more of the protein markers provided herein is obtained and compared to control values to determine a diagnosis/classification.
- a blood-derived sample may be applied to a specific binding agent (e.g. , antibody) or panel of specific binding agents to determine the presence of the markers of interest and/or quantify the markers within the sample.
- the analysis will generally include detecting and/or quantifying at least one of the markers described herein, e.g., apolipoprotein H, CD40, haptoglobin, interleukin-8 (“IL-8”), monocyte chemoattractant protein- 1 (“MCP-I”), tumor necrosis factor receptor II (“TNF-RII”), apolipoprotein CIII, granulocyte-macrophage colony stimulating factor (“GM-CSF”), immunoglobulin A ("IgA”), macrophage inflammatory protein 1 alpha (“MIP-Ia”), tissue factor, tumor necrosis factor-alpha (“TNF- ⁇ ”), alpha- 1 antitrypsin, C-reactive protein ( 11 CRP”), fibrinogen, interleukin-4 (IL-4), macrophage-derived chemokine (“MDC”), and soluble vascular cell adhesion molecule 1 (“sVCAM-1 "); usually at least two of the markers, more usually at least three of the markers, and may include 4, 5, 6,
- a preferred set of markers comprises at least three of the following multi-analyte panel: apolipoprotein H, CD40, haptoglobin, IL-8, MCP-I and TNF-RII; and may include, 4, 5 or all 6 of them.
- This multi-analyte panel represents Signature 1 as described herein.
- the preferred at least three markers to quantify for this method are apoliprotein H, MCP-I and TNF-RII, providing an overall accuracy of correctly identifying a plasma sample from a COPD rapid decliner versus a healthy subject of approximately 89% (see Table 6, infra). The degree of accuracy is increased to approximately 90% by assessing quantitative data for all 6 of the listed biomarkers within Signature 1 (see Table 6, infra).
- a preferred set of markers comprises at least three of the following mult ⁇ -analyte panel: apolipoprotein CIII, CD40, haptoglobin, GM-CSF, IgA, MIP- l ⁇ , tissue factor and TNF- ⁇ ; and may include 4, 5, 6, 7 or all 8 of them.
- This multi-analyte panel represents Signature 2 as described herein.
- the preferred at least three markers to quantify for this method are IgA, MIP-I ⁇ and tissue factor, providing an overall accuracy of correctly identifying a plasma sample from a COPD slow decliner versus a healthy subject of approximately 76% (see Table 6, infra).
- the degree of accuracy is increased to approximately 78% by further quantifying GM-CSF (for a total of 4 biomarkers) and increased to approximately 83% by further quantifying GM-CSF and apolipoprotein CIII (for a total of 5 biomarkers).
- the degree of accuracy is increased to approximately 94% by assessing quantitative data for all 8 of the listed biomarkers (see Table 6, infra).
- a preferred set of markers comprises at least three of the following multi-analyte panel: alpha-1 antitrypsin, CRP, fibrinogen, GM-CSF, IL-4, MDC, tissue factor, TNF-RJI and sVCAM- 1 ; and may include 4, 5, 6, 7, 8 or all 9 of them.
- This multi-analyte panel represents Signature 3 as described herein.
- the preferred at least three markers to quantify for this method are MDC, tissue factor and sVCAM-1, providing an overall accuracy of correctly distinguishing a plasma sample derived from a COPD rapid decliner as opposed to a plasma sample derived from a COPD slow decliner of approximately 80% (see Table 6, infra).
- the degree of accuracy is increased to approximately 85% by further quantifying IL-4 (for a total of 4 biomarkers), and the degree of accuracy is increased to approximately 92% by assessing quantitative data for all 9 of the listed biomarkers.
- an individual test dataset When staging the severity of COPD (i.e., rapid decliner versus slow decliner), an individual test dataset will be compared against one or more reference datasets obtained from disease samples of a known stage, constructing a model that predicts stage and inputting a dataset in that model to obtain a predicted staging.
- the present invention provides for computer-implemented methods for classifying a test sample obtained from a mammalian subject, comprising (a) obtaining a dataset associated with said sample, wherein said obtained dataset (i.e., test dataset) comprises quantitative data for at least three protein markers selected from a multi-analyte panel selected from the group consisting of: (i) apolipoprotein H, CD40, haptoglobin, interleukin-8 (IL-8), monocyte chemoattractant protein- 1 (MCP-I) and tumor necrosis factor receptor II (TNF-SRJI); (ii) apolipoprotein CHI, CD40, granulocyte-macrophage colony stimulating factor (GM-CSF), interleukin-4 (IL-4), haptoglobin, immunoglobulin A (IgA), macrophage inflammatory protein 1 alpha (MIP- lot), tissue factor and tumor necrosis factor-alpha (TNF- ⁇ ); and, (ii)
- the present invention further provides for methods for classifying a test sample obtained from a human subject, comprising: (a) obtaining a dataset associated with said test sample, wherein said obtained dataset comprises quantitative data for at least three protein markers selected from a multi-analyte panel selected from the group consisting of: (i) apolipoprotein H, CD40, haptoglobin, interleukin-8 (IL-8), monocyte chemoattractant protein-1 (MCP-I) and tumor necrosis factor receptor II (TNF-RIT); (ii) apolipoprotein CIII, CD40, granulocyte-macrophage colony stimulating factor (GM-CSF), haptoglobin, immunoglobulin A (IgA), macrophage inflammatory protein 1 alpha (MIP- l ⁇ ), tissue factor and tumor necrosis factor-alpha (TNF- ⁇ ); and, (iii) alpha- 1 antitrypsin, C-reactive protein (CRP), fibrinogen, granulocyte-ma
- the classification methods described herein may be used to identify a subject (e.g., a human patient) for diagnosis or disease staging purposes in order to accurately develop a course of treatment most suitable for said individual.
- the classification methods may also be used to help calculate and assess efficacy of a drug for treating COPD during a clinical trial.
- this information can be transformed, for example, to generate a more effective treatment plan to limit further development of COPD in said subject or to determine efficacy of a drug candidate for use in COPD treatment.
- the methods disclosed herein can be practiced with the determination of the concentration in a test plasma sample of at least a subset of the three biomarkers signatures described herein. In certain embodiments, greater than 90% accuracy in making a correct diagnosis is provided by the disclosed methods.
- the methods described herein may be implemented using any device capable of implementing the methods.
- devices that may be used include, but are not limited to, electronic computational devices, including computers of all types.
- the computer program that may be used to configure the computer to carry out the steps of the methods may be contained in any computer readable medium capable of containing the computer program. Examples of computer readable medium that may be used include, but are not limited to, diskettes, CD-ROMs, DVDs, ROM, RAM, and other memory and computer storage devices.
- the computer program that may be used to configure the computer to carry out the steps of the methods may also be provided over an electronic network, for example, over the internet, world wide web, an intranet, or other network.
- the methods described herein may be implemented in a system comprising a processor and a computer readable medium that includes program code means for causing the system to carry out the steps of the methods.
- the processor may be any processor capable of carrying out the operations needed for implementation of the methods.
- the program code means may be any code that when implemented in the system can cause the system to carry out the steps of the methods described herein. Examples of program code means include, but are not limited to, instructions to carry out the methods described in this application written in a high level computer language, such as C++, Java, or Fortran; instructions to carry out the methods described in this application written in a low level computer language, such as assembly language; or, instructions to carry out the methods described herein in a computer executable form, such as compiled and linked machine language.
- the quantitation of markers in a sample is determined by the methods described above and as known in the art.
- Quantitative data obtained from a test sample (“obtained dataset” or "test dataset,” as used interchangeably herein) is subjected to an analytic classification process that compares the obtained dataset to one or more reference datasets.
- the raw data including but not limited to protein concentration data for the tested biomarkers, is quantified and compared to a predictive model generated by assessing one or more reference populations using the same quantitative data as gathered from the test sample.
- the predictive model is generated using a training set of data from reference populations, wherein both the training dataset and the test (or obtained) dataset is comprised of the same quantitative information.
- a dataset associated with a test sample consisting of circulating plasma concentration data of apolipoprotein H, MCP-I and TNF-RH is compared to a predictive model generated using the same set of concentration data (i.e., circulating plasma concentration of apolipoprotein H, MCP-I and TNF-RII) obtained from both healthy individuals (reference sample 1) and rapidly declining COPD individuals (reference sample 2).
- the methods of the invention use a classifier for diagnosing or classifying COPD.
- the classifier can be based on any appropriate pattern recognition method that receives an input comprising a multi-analyte profile and provides an. output comprising data indicating to which group a test sample belongs.
- the classifer can be trained with training data from one or more reference population(s) of subjects (reference dataset(s)).
- the training data comprise, for each of the subjects in the training population, a multi-analyte profile comprising quantitative measurements of biomarker proteins in a suitable sample taken from the patient.
- An analytic classification process may use any one of a variety of statistical analytic methods to manipulate the quantitative data and provide for classification of the test sample.
- useful methods include linear discriminant analysis (LDA), recursive feature elimination, a prediction analysis of microarray, a logistic regression, a CART algorithm, a FlexTree algorithm, a LART algorithm, a random forest algorithm, a MART algorithm and machine learning algorithms.
- LDA linear discriminant analysis
- classification of interest includes, without limitation, the assignment of a test sample to one or more of the following states: i) a COPD state, including a rapidly declining COPD state or a slowly declining COPD state; and, ii) a healthy state (no disease state).
- Classifications also may be made by determining whether a comparison between an obtained dataset and a reference dataset yields a statistically significant difference. If so, then the sample from which the dataset was obtained is classified as not belonging to the reference dataset class. Conversely, if such a comparison is not statistically significant from the reference dataset, then the sample from which the dataset was obtained is classified as belonging to the reference dataset class.
- Classification can be made according to predictive modeling methods that set a threshold for determining the probability that a sample belongs to a given class.
- the probability of the methods of the present invention is at least about 70%, preferably at least about 80%, and more preferably at least about 90% or higher.
- the predictive ability of a model may be evaluated according to its ability to provide a quality metric, e.g. accuracy, of a particular value or range of values.
- a desired quality threshold has the ability to classify a test sample with an accuracy of at least about 70%, at least about 75%, at least about 80%, at least about 85%, or at least about 90% or higher.
- the term “accuracy” refers to the computed ability of an individual marker or a combination of markers to correctly identify a disease state (e.g., COPD) from a control state (e.g., healthy).
- a disease state e.g., COPD
- a control state e.g., healthy
- the relative sensitivity and specificity of a predictive model can be tuned to favor either the selectivity metric or the sensitivity metric, where the two metrics have an inverse relationship.
- sensitivity refers to the ability of an individual marker or a combination of markers to correctly identify a disease state
- specificity refers to the ability of the marker(s) to correctly identify a normal (e.g., non- diseased) state.
- sensitivity and specificity may be at least about at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, or at least about 90% or higher.
- the raw data may be initially analyzed by measuring the values (e.g. , concentration) for each marker, usually in triplicate or in multiple triplicates.
- the data may be manipulated, for example, raw data may be transformed using standard curves, and the average of triplicate measurements used to calculate the average and standard deviation for each patient. These values may be transformed before being used in the models, e.g. log-transformed, Box- Cox transformed (see Box and Cox (1964) J. Royal Stat. Soc, Series B, 26:211-246).
- the data are then input into a predictive model, which will classify the sample according to the state.
- the resulting information may be transmitted to a patient, health professional or clinical research analyst.
- a robust data set comprising known control samples and/or samples corresponding to a classification of interest are used in a training set ("reference dataset").
- a sample size is selected using generally accepted criteria.
- different statistical methods can be used to obtain a highly accurate predictive model.
- An example of such analysis using linear disciminant analysis (LDA) is provided in Example 3.
- LDA Linear discriminant analysis attempts to classify a test sample or subject into one of two categories based on certain object properties. In other words, LDA tests whether object attributes measured in an experiment predict categorization of the objects. LDA typically requires continuous independent variables and a dichotomous categorical dependent variable.
- the quantitative values e.g., protein concentration data
- the clinical group classification of each member of the training population serves as the dichotomous categorical dependent variable.
- LDA seeks the linear combination of variables that maximizes the ratio of between-group variance and within-group variance by using the grouping information. Implicitly, the linear weights used by LDA depend on how the quantitative value (e.g. , protein concentration) of a biomarker across the training set separates in the two groups (e.g., a COPD control group and a healthy control group).
- LDA is applied to the data matrix of the N members in the training sample by K biomarkers in a combination of biomarkers described in the present invention. Then, the linear discriminant of each member of the training population is plotted. Ideally, those members of the training population representing a first control subgroup (e.g.
- Quadratic discriminant analysis takes the same input parameters and returns the same results as LDA.
- QDA uses quadratic equations, rather than linear equations, to produce results.
- LDA and QDA are interchangeable, and which to use is a matter of preference and/or availability of software to support the analysis.
- Logistic regression takes the same input parameters and returns the same results as LDA and QDA.
- hierarchical clustering is performed in the derivation of a predictive model, where the Pearson correlation is employed as the clustering metric.
- One approach is to consider a COPD dataset as a "learning sample” in a problem of "supervised learning.”
- CART is a standard in applications to medicine (Singer (1999) Recursive Partitioning in the Health Sciences, Springer), which may be modified by transforming any qualitative features Io quantitative features; sorting them by attained significance levels, evaluated by sample reuse methods for Hotelling's T 2 statistic; and suitable application of the lasso method.
- Problems in prediction are turned into problems in regression without losing sight of prediction, indeed by making suitable use of the Gini criterion for classification in evaluating the quality of regressions.
- Logic regression resembles CART in that its classifier can be displayed as a binary tree. It is different in that each node has Boolean statements about features that are more general than the simple "and" statements produced by CART.
- the false discovery rate may be determined.
- a set of null distributions of dissimilarity values is generated.
- the values of observed profiles are permuted to create a sequence of distributions of correlation coefficients obtained out of chance, thereby creating an appropriate set of null distributions of correlation coefficients (see Tusher et al., 2001, Proc. Natl Acad. Sci. USA 98, 51 16-21; herein incorporated by reference).
- the set of null distribution is obtained by: permuting the values of each profile for all available profiles; calculating the pair- wise correlation coefficients for all profile; calculating the probability density function of the correlation coefficients for this permutation; and, repeating the procedure for N times, where N is a large number, usually about 300.
- N is a large number, usually about 300.
- the FDR is the ratio of the number of the expected falsely significant correlations
- This cut-off correlation value may be applied to the correlations between experimental profiles. Using the aforementioned distribution, a level of confidence is chosen for significance. This is used to determine the lowest value of the correlation coefficient that exceeds the result that would have been obtained by chance. Using this method, one obtains thresholds for positive correlation, negative correlation or both. Using this threshold(s), the user can filter the observed values of the pairwise correlation coefficients and eliminate those that do not exceed the threshold(s). Furthermore, an estimate of the false positive rate can be obtained for a given threshold. For each of the individual "random correlation" distributions, one can find how many observations fall outside the threshold range. This procedure provides a sequence of counts. The mean and the standard deviation of the sequence provide the average number of potential false positives and its standard deviation.
- variables chosen in the cross-sectional analysis are separately employed as predictors. Given the specific COPD outcome, the random lengths of time each patient will be observed, and selection of proteomic and other features, a parametric approach to analyzing survival may be better than the widely applied semi-parametric Cox model.
- a Weibull parametric fit of survival permits the hazard rate to be monotonically increasing, decreasing, or constant, and also has a proportional hazards representation (as does the Cox model) and an accelerated failure-time representation. All the standard tools available in obtaining approximate maximum likelihood estimators of regression coefficients and functions of them are available with this model.
- Cox models may be used, especially since reductions of numbers of covariates to manageable size with the lasso will significantly simplify the analysis, allowing the possibility of an entirely nonparametric approach to survival.
- These statistical tools are applicable to all manner of proteomic data.
- a subset of markers i.e. at least 3, at least 4, at least 5, at least 6, or up to the complete set of markers.
- a subset of markers will be chosen that provides for the needs of the quantitative sample analysis (e.g. , availability of reagents, convenience of quantitation, etc.) while maintaining a highly accurate predictive model.
- the selection of a number of informative markers for building classification models requires the definition of a performance metric and a user-defined threshold for producing a model with useful predictive ability based on this metric.
- the performance metric may be the sensitivity and/or specificity of the prediction, as well as the overall accuracy of the prediction model.
- LDA was used in a training model to identify biomarkers relevant to COPD and certain stages of the disease.
- the selection of a subset of markers may be used for a forward selection or a backward selection of a marker subset.
- a number of markers may be selected that will optimize the performance of a model without the use of all the markers (see Table 6 in Example 3, infra).
- One way to define the optimum number of terms is to choose the number of terms that produce a model with desired predictive ability (e.g., an accuracy of greater than 80%, or equivalent measures of sensitivity/specificity) that lies no more than one standard error from the maximum value obtained for this metric using any combination and number of terms used for the given analytic process.
- Reagents and Kits Also provided are reagents and kits thereof for practicing one or more of the above-described methods.
- the subject reagents and kits thereof may vary greatly.
- Reagents of interest include reagents specifically designed for use in production of the above described expression profiles of circulating protein markers associated with COPD.
- kits for practicing one or more of the above- described methods that include at least one reagent specific for a COPD biomarker described herein, wherein the COPD biomarker is selected from the group consisting of apolipoprotein H, CD40, haptoglobin, interleukin-8 (“IL-8”), monocyte chemoattractant protein- 1 (“MCP-I”), tumor necrosis factor receptor II (“TNF-RII”), apolipoprotein CIII, granulocyte-macrophage colony stimulating factor (“GM-CSF”), immunoglobulin A (“IgA”), macrophage inflammatory protein 1-al ⁇ ha (“MIP-I ⁇ "), tissue factor, tumor necrosis factor-alpha (“TNF- ⁇ ”), alpha-1 antitrypsin, C-reactive protein (“CRP”), fibrinogen, interleukin-4 (“IL-4"), macrophage-derived chemokine (“MDC”), and soluble vascular cell adhesion molecule 1 (“sVCAM)
- said kit comprises reagents for detecting at least three protein markers selected from one or more of the biomarkers signatures described herein.
- said kit may comprise reagents for detecting at least three protein markers selected from a multi-analyte panel selected from the group consisting of: (a) apolipoprotein H, CD40, haptoglobin, interleukin-8 (IL-8), monocyte chemoattractant protein- 1 (MCP-I) and tumor necrosis factor receptor II (TNF-RII);
- CRP C-reactive protein
- IL-4 interleukin-4
- MDC macrophage-derived chemokine
- sVCAM-1 soluble vascular cell adhesion molecule 1
- a kit of the present invention may include reagents that are labeled compounds or agents useful to detect a polypeptide or an mRNA encoding a polypeptide corresponding to a COPD biomarker disclosed herein in a biological sample and means for determining the amount of the polypeptide or mRNA in the sample (e.g., an antibody that binds the polypeptide or an oligonucleotide probe that binds to DNA or mRNA encoding the polypeptide).
- One type of such reagent suitable for binding with a polypeptide corresponding to a COPD biomarker is an antibody or fragment thereof (including an antibody derivative) that binds to a marker of interest.
- suitable reagents for binding with a nucleic acid include complementary nucleic acids.
- a nucleic acid e.g., genomic DNA, mRNA, spliced mRNA, cDNA
- suitable reagents for binding with a nucleic acid include complementary nucleic acids.
- array formats are known in the art, with a wide variety of different probe structures, substrate compositions and attachment technologies.
- the reagent is directly or indirectly labeled with a detectable substance.
- the expression of the one or more biomarkers is detected by: (a) detecting the expression of a polypeptide which is regulated by the one or more biomarkers; (b) detecting the expression of a polypeptide which regulates the biomarker; or, (c) detecting the expression of a metabolite of the biomarker.
- the kit can comprise, for example: (1) a first antibody (e.g., attached to a solid support) which binds a polypeptide corresponding to a biomarker of the invention; and, optionally (2) a second, different antibody that binds to either the polypeptide or the first antibody and is conjugated to a detectable label.
- a first antibody e.g., attached to a solid support
- a second, different antibody that binds to either the polypeptide or the first antibody and is conjugated to a detectable label.
- the kit can comprise, for example: (1) an oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic acid sequence encoding a polypeptide corresponding to a biomarker of the invention, or (2) a pair of primers useful for amplifying a nucleic acid molecule corresponding to a biomarker of the invention.
- an oligonucleotide e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic acid sequence encoding a polypeptide corresponding to a biomarker of the invention
- a pair of primers useful for amplifying a nucleic acid molecule corresponding to a biomarker of the invention.
- the kit can also comprise other components, such as a buffering agent, a preservative, a protein stabilizing agent, and/or components necessary for detecting the detectable label.
- the kit may include reagents employed in the various methods, such as devices for withdrawing and handling blood samples, second stage antibodies, ELISA reagents; tubes, spin columns, and the like. Each component of the kit can be enclosed within an individual container and all of the various containers can be within a single package.
- Representative array or kit compositions of interest include or consist of reagents for quantitation of at least three, at least four, at least five or all six markers selected from the group consisting of apolipoprotein H, CD40, haptoglobin, IL-8, MCP-I and TNF-RII.
- This kit can be used to classify a test sample as and/or diagnosing a subject with rapidly declining COPD, as opposed to a healthy patient.
- the preferred at least three markers to quantify using reagents included within the kit may comprise or consist of apoliprotein H, MCP-I and TNF-RIL
- a representative array or kit includes or consists of reagents for quantitation of at least three, at least four, at least five, at least six, at least seven or all eight markers selected from the group consisting of apolipoprotein CIII, CD40, haptoglobin, GM-CSF, IgA, MIP-Ia, tissue factor and TNF- ⁇ .
- This kit can be used to classify a test sample as and/or diagnosing a subject with slowly declining COPD, as opposed to a healthy patient.
- the preferred at least three markers to quantify using reagents included within the kit may comprise or consist of IgA 5 MIP- l ⁇ and tissue factor.
- a representative array or kit includes or consists of reagents for quantitation of at least three, at least four, at least five, at least six, at least seven, at least eight or all nine markers selected from the group consisting of alpha- 1 antitrypsin, CRP, fibrinogen, GM-CSF, IL-4, MDC, tissue factor, TNF-RII and sVCAM-1.
- This kit can be used to classify a test sample as and/or diagnosing a subject with rapid declining COPD, as opposed to slowly declining COPD.
- the preferred at least three markers to quantify using reagents included within the kit may comprise or consist of MDC, tissue factor and sVCAM-1.
- the kits may further include a software package for statistical analysis of one or more phenotypes, and may include a reference database(s) for calculating the probability of classification within a predictive model.
- the subject kits may further include instructions for practicing the subject methods and for interpreting the results of the assays performed using the kit.
- These instructions may be present in the subject kits in a variety of forms, one or more of which may be present in the kit.
- One form in which these instructions may be present is as printed information on a suitable medium or substrate, e.g., a piece or pieces of paper on which the information is printed, in the packaging of the kit, in a package insert, etc.
- Yet another means would be a computer readable medium, e.g., diskette, CD, etc., on which the information has been recorded.
- Yet another means that may be present is a website address which may be used via the internet to access the information at a removed site. Any convenient means may be present in the kits.
- FVC FEVl/forced vital capacity
- Frozen plasma samples were sent to Rules-Based Medicine, Inc. (Austin, TX) for analysis of markers in their proprietary Human Antigen MAP platform. Frozen plasma samples were thawed at room temperature, vortexed, spun at 13,000 x g for 5 minutes for clarification, and 40 uL was removed for markers analysis into a master microtiter plate. Using automated pipetting, an aliquot of each sample was introduced into one of the capture microsphere multiplexes of the Human Antigen MAP. These mixtures of sample and capture microspheres were thoroughly mixed and incubated at room temperature for 1 hour. Multiplexed cocktails of biotinylated, reporter antibodies for each multiplex were then added robotically and, after thorough mixing, were incubated for an additional hour at room temperature.
- Unknown values for each of the analytes localized in a specific multiplex were determined using 4 and 5 parameter, weighted and non- weighted curve fitting algorithms included in the data analysis package.
- False Discovery Rate (FDR), estimated by q- value, is the proportion of significant changes that are false positives.
- the q-value for each marker was derived using the method proposed by Benjamini & Hockberg (Benjamini and Hockberg, 2000, J. Behav. Educ. Statist. 25:60-83). All analyses were carried out using R, version 2.4 (R Development Core Team (2006). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051 -07-0, URL http://www.R-project.org.)
- Cancer antigen 19.9 immunoglobulin type A (IgA) 5 insulin, macrophage inflammatory protein 1 alpha (MIP- l ⁇ ) and soluble vascular cell adhesion molecule 1 (sVCAM-1) were present at significantly lower levels in rapid decliners compared to slow decliners.
- IgA immunoglobulin type A
- MIP- l ⁇ macrophage inflammatory protein 1 alpha
- sVCAM-1 soluble vascular cell adhesion molecule 1
- Multivariate analysis for the identification of signatures - Mult-analyte panels (signatures) that provide optimal separation between COPD rapid decliners and healthy controls, COPD slow decliners and healthy controls, and COPD rapid and COPD slow decliners were determined using a forward selection procedure with Linear Discriminant Analysis (Venables, W.N. & Ripley, B.D., (2002). Modern Applied Statistics, Fourth Edition. Springer).
- This analysis measured the distance from each point in the data set to each group's multivariate mean ⁇ called a centroid) and classified the point to the closest group.
- the distance measure sued was the Mahalanobis distance which takes into account the variances and covariances between the variables.
- Table 6 Summary of representative multi-analyte panels (signatures), along with their predictive erformance, as identified b multivariate anal sis.
- LDA Linear Discriminant Analysis
- This method identified a group of plasma markers (signature 1) capable of accurately distinguishing COPD rapid decliners from controls (healthy smokers) ( Figure 2A).
- Figure 2A is an output from a linear discriminant analysis (LDA) of a two-dimensional representation using principal components of a six-marker signature for separating COPD rapid decliners from healthy controls.
- the distance measure used is the Mahalanobis distance, which takes into account the variance and covariance between the variables.
- Each multivariate mean is a labeled circle. The size of the circle corresponds to a 95% confidence limit for the mean.
- Signature 1 is composed of the following 6 markers: apolipoprotein H, CD40, haptoglobin, interleukin-8 (IL-8), monocyte chemoattractant protein- 1 (MCP-I) and tumor necrosis factor receptor II (TNF-RJI).
- apolipoprotein H CD40
- haptoglobin IL-8
- IL-8 interleukin-8
- MCP-I monocyte chemoattractant protein- 1
- TNF-RJI tumor necrosis factor receptor II
- signature 1 can correctly identify that a plasma sample is from a COPD rapid decliner in 93% of the cases (sensitivity) and that a plasma sample is from a healthy subject in 86% of the cases (specificity) for an overall accuracy to distinguish between the two groups of approximately 90%.
- Figure 2 is an output from a linear discriminant analysis (LDA) of a two-dimensional representation using principal components of an eight-marker signature for separating COPD slow decliners from healthy controls.
- Signature 2 is composed of the following 8 markers: apolipoprotein CIII, CD40, granulocyte-macrophage colony stimulating factor (GM-CSF), haptoglobin, immunoglobulin A (IgA), macrophage inflammatory protein 1 alpha (MIP- l ⁇ ), tissue factor and tumor necrosis factor-alpha (TNF- ⁇ ).
- IgA immunoglobulin A
- MIP- l ⁇ macrophage inflammatory protein 1 alpha
- TNF- ⁇ tumor necrosis factor-alpha
- Figure 2C is an output from a linear discriminant analysis (LDA) of a two-dimensional representation using principal components of a nine-marker signature for separating COPD slow and rapid decliners.
- LDA linear discriminant analysis
- Signature 3 is composed of the following 9 markers: alpha- 1 antitrypsin, C-reactive protein (CRP), fibrinogen, granulocyte-macrophage colony stimulating factor (GM-CSF), interleukin-4 (IL-4), macrophage-derived chemokine (MDC), tissue factor, tumor necrosis factor receptor II (TNF- RII) and soluble vascular cell adhesion molecule-1 (s VC AM-I).
- CRP C-reactive protein
- GM-CSF granulocyte-macrophage colony stimulating factor
- IL-4 interleukin-4
- MDC macrophage-derived chemokine
- TNF- RII tumor necrosis factor receptor II
- s VC AM-I soluble vascular cell adhesion molecule-1
- Table 7 lists representative composites of biomarkers from LDA analysis that provide impressive levels of sensitivity and specificity.
- LDA for assessing the performance of the various composites of markers was performed using a contributed library within R (Venables, W.N. & Ripley, B.D., (2002). Modern Applied Statistics, Fourth Edition. Springer, New York).
- the 5 -fold cross-validation for obtaining reliable estimates of the performance metrics of the biomarker composites was performed using a contributed library within R (Andrea Peters and Torsten Hothorn, 2004, Improved Predictors, R package version 0.8-3).
- Table 7 Summary of the percent accuracy (“Ace”), sensitivity (“Sen”) and specificity (“Spe”) as determined by Linear Discrininant Analysis (LDA) for each of the three biomarker signatures (“optimal-LDA”) disclosed in the present application, as well as for subsets of these optimal signatures containing the best of 3, 4 and/or 5 of the biomarkers.
- Ace percent accuracy
- Sen sensitivity
- Spe specificity
- LDA Linear Discrininant Analysis
- markers showing differences between rapid decliners and controls have been previously reported to be modulated in the blood of non-exacerbated COPD patients, such as ⁇ l-antitrypsin (Aldonyte et al, 2004, COPD 1:155-164), eotaxin (Aldonyte et at, 2004, supra; Janhz-Rozyk et al., 2000, Pol. Merkur. Lekarskl 9:649-652), fibrinogen (Gan et al., 2004, Thorax 59:574-580), IL-4 (Zhang et al., 1999, J Tongji Med. Univ.
- IL-8 IL-8
- MCP-I vascular cell adhesion molecule- 1
- VCAM-1 interferes with the adhesion of VCAM-I -bearing leukocytes to endothelial cells expressing the VCAM-I ligand, ⁇ 4 ⁇ l integrin (Foster, 1996, J Allergy Clin. Immunol 98:S270 ⁇ S277).
- chemoattractants such as GM-CSF, IL-8 and MCP-I known to modulate the activity of neutrophils and macrophages, two cell types implicated in COPD (Barnes, 2004, Pharmacol Rev. 56:515-548). It is even more interesting to note in that list the presence of several mediators associated to Th2/Tc2 phenotype such as IL-4, IL-5, IL-10, IL- 13 and eotaxin.
- Chronic Obstructive Pulmonary Disease Chronic Obstructive Pulmonary Disease: Pathogenesis to Treatment: Novartis Foundation Symposium, Volume 234. Eds. Arthur Chadwick & Jamie A. Goode. Chichester: Wiley, 2001. 149-168).
- COPD like many other chronic diseases, is thought to be a highly heterogeneous disease with phenotypic expressions influenced by genetic and environmental factors. This leads to the expected intra-group variability observed for each plasma markers when performing univariate analysis. Similar variability after univariate analysis of markers in serum from COPD and controls was recently described by another group (Pinto-Plata et al., 2007, supra). In this context, each individual plasma marker has limited potential to accurately distinguish between samples from healthy individuals and patients with either slow or rapid forms of COPD. The combination of multiple markers in a multivariate analysis takes in consideration this intra-group heterogeneity. Multivariate analysis led to the identification of a 6-marker signature capable of distinguishing rapid decliners from healthy controls with about 90% accuracy.
- the signature is composed of chemoattractants for neutrophils and monocytes such as IL- 8 and MCP-I with putative roles in the inflammation that characterizes COPD (Barnes, 2004, supra); the circulating soluble form of a surface glycoprotein named CD40 previously reported to be elevated in patients with chronic renal failure (Schwabe et al., 1999, CHn. Exp. Immunol.
- sTNFRII the anti-inflammatory sTNFRII
- apolipoprotein H a plasma glycoprotein implicated in a variety of physiological pathways including blood coagulation, haemostasis and the production of anti-phospholipid antibodies
- sCD40 and haptoglobin are also part of a group of 8 plasma markers capable of distinguishing slow decliners from controls with about 94% accuracy.
- This signature also includes 3 other cytokines/chemokines: GM-CSF, a regulator of neutrophil survival and activity found at elevated levels in bronchoalveolar fluid from stable and exacerbated COPD patients (Balbi et al., 1997, Eur. Respir. J. 10:846-850); MIPl- ⁇ , a chemoattractant for neutrophils and monocytes (Barnes, 2004, supra); and, TNF- ⁇ , a cytokine implicated in COPD (Barnes, 2004, supra) and found at higher levels in the blood of stable COPD patients (Gan et al., 2994, supra).
- GM-CSF a regulator of neutrophil survival and activity found at elevated levels in bronchoalveolar fluid from stable and exacerbated COPD patients
- MIPl- ⁇ a chemoattractant for neutrophils and monocytes
- TNF- ⁇ a cytokine implicated in COPD (Barnes, 2004, supra) and found at higher levels in the blood of stable
- the COPD slow decliner signature is completed by apolipoprotein CIII, a modulator of lipoprotein metabolism (Shachter, 2001, Curr. Opin. Lipidol 12:297-304); IgA 5 a subtype of circulating immunoglobulin that has been previously associated with anti -phospholipid antibodies (Staub et al., 2006, Autoimmun. Rev. 6:104-106); and, soluble tissue factor (sTF), a coagulating factor that has been found to be elevated in patients with anti-phospholipid syndrome characterized by the presence of anti-phospholipid antibodies and thromboembolic complications.
- sTF soluble tissue factor
- the rapid versus slow COPD decliner signature is composed of 9 plasma markers. Three of those markers, ⁇ l -antitrypsin, C-reactive Protein (CRP) and fibrinogen, have been extensively reported to be modulated in COPD patients (Gan et al., 2004, supra; Ranes and Stoller, 2005, Semin. Respir. Crit. Care Med. 26:154-166). Three other markers are cytokines/chemokines: GM-CSF; IL-4; and, macrophage-derived chemokine (MDC). GM-CSF is also included in the slow decliner signature and has been discussed above. IL-4 is the most significantly increased marker in rapid decliners compared to slow decliners and controls in the univariate analysis.
- MDC (CCL22) has been shown to be up-regulated by IL-4 and plays an important role in the recruitment of Th2 cells to inflammatory sites (Gan et al., 2004, supra; Yamashita and Kuroda, 2002, Crit. Rev. Immunol 22: 105-114).
- the signature is completed with sTF, sTNFRII and sVCAM-1 described above. Attempts to identify a single signature that would distinguish controls, slow and rapid COPD decliners did not yield accuracies above 82%.
- the need for distinct signatures to accurately distinguish rapid or slow COPD decliners from controls and the fact that a signature can be found that accurately distinguishes rapid from slow COPD decliners suggest the existence of fundamental biochemical differences linked to the rate of lung function decline in COPD. To our knowledge, this is the first study describing plasma markers differentiating slow from rapid declining form of COPD.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medical Informatics (AREA)
- Data Mining & Analysis (AREA)
- Bioethics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Databases & Information Systems (AREA)
- Epidemiology (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Public Health (AREA)
- Software Systems (AREA)
- Biophysics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
Abstract
Description
Claims
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009801138637A CN102089654A (en) | 2008-03-10 | 2009-02-27 | COPD biomarker signatures |
EP09718959.1A EP2269060B1 (en) | 2008-03-10 | 2009-02-27 | Copd biomarker signatures |
CA2718251A CA2718251A1 (en) | 2008-03-10 | 2009-02-27 | Copd biomarker signatures |
JP2010550748A JP5574990B2 (en) | 2008-03-10 | 2009-02-27 | COPD biomarker signature |
US12/921,997 US20110160070A1 (en) | 2008-03-10 | 2009-02-27 | Copd biomarker signatures |
US15/650,702 US20180004895A1 (en) | 2008-03-10 | 2017-07-14 | COPD Biomarker Signatures |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US6877208P | 2008-03-10 | 2008-03-10 | |
US61/068,772 | 2008-03-10 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/921,997 A-371-Of-International US20110160070A1 (en) | 2008-03-10 | 2009-02-27 | Copd biomarker signatures |
US15/650,702 Continuation US20180004895A1 (en) | 2008-03-10 | 2017-07-14 | COPD Biomarker Signatures |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2009114292A1 true WO2009114292A1 (en) | 2009-09-17 |
Family
ID=41065531
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2009/035376 WO2009114292A1 (en) | 2008-03-10 | 2009-02-27 | Copd biomarker signatures |
Country Status (6)
Country | Link |
---|---|
US (2) | US20110160070A1 (en) |
EP (2) | EP3156925A3 (en) |
JP (1) | JP5574990B2 (en) |
CN (1) | CN102089654A (en) |
CA (1) | CA2718251A1 (en) |
WO (1) | WO2009114292A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011100792A1 (en) * | 2010-02-16 | 2011-08-25 | Crc For Asthma And Airways Ltd | Protein biomarkers for obstructive airways diseases |
JP2013524202A (en) * | 2010-03-29 | 2013-06-17 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | COPD exacerbation detection using sputum analysis |
US8812249B2 (en) | 2009-06-25 | 2014-08-19 | University Hospital Of North Staffordshire Nhs Trust | Analyzer apparatus and methods for lung disease |
US10471086B2 (en) | 2011-03-07 | 2019-11-12 | Temple University—Of the Commonwealth System of Higher Education | Treatment of chronic obstructive pulmonary disease (COPD) |
EP3610260A4 (en) * | 2017-04-12 | 2021-01-06 | ProterixBio, Inc. | Biomarker combinations for monitoring chronic obstructive pulmonary disease and/or associated mechanisms |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013190092A1 (en) | 2012-06-21 | 2013-12-27 | Philip Morris Products S.A. | Gene signatures for copd diagnosis |
US20140012637A1 (en) * | 2012-07-06 | 2014-01-09 | Xerox Corporation | Traffic delay detection by mining ticket validation transactions |
US9103837B2 (en) | 2012-11-07 | 2015-08-11 | Somalogic, Inc. | Chronic obstructive pulmonary disease (COPD) biomarkers and uses thereof |
WO2017114570A1 (en) * | 2015-12-30 | 2017-07-06 | Institute Of Physics Belgrade, University Of Belgrade | Security tag containing a pattern of biological particles |
CN109680060A (en) * | 2017-10-17 | 2019-04-26 | 华东师范大学 | Methylate marker and its application in diagnosing tumor, classification |
CN112204399B (en) * | 2018-06-01 | 2024-07-26 | 高露洁-棕榄公司 | Methods, devices and systems for quantifying biomarkers |
SG11202103884UA (en) * | 2018-10-30 | 2021-05-28 | Somalogic Inc | Methods for sample quality assessment |
US10354205B1 (en) | 2018-11-29 | 2019-07-16 | Capital One Services, Llc | Machine learning system and apparatus for sampling labelled data |
CN110824166A (en) * | 2019-11-20 | 2020-02-21 | 淄博市中心医院 | Convenient tumor marker molecule detection method |
WO2022020755A2 (en) * | 2020-07-24 | 2022-01-27 | The Regents Of The University Of California | Biomarkers and methods of selecting and using the same |
CN112725435A (en) * | 2021-01-28 | 2021-04-30 | 中国医学科学院北京协和医院 | Susceptibility gene of chronic obstructive pulmonary acute exacerbation and application thereof in predicting susceptibility to chronic obstructive pulmonary acute exacerbation |
CN113951867B (en) * | 2021-10-19 | 2023-10-17 | 天津大学 | Chronic obstructive pulmonary disease recognition device based on oral-nasal airflow signal |
CN114708918A (en) * | 2022-04-20 | 2022-07-05 | 中国科学院深圳先进技术研究院 | Identification method and system for rare macrophage subpopulations and disease markers in idiopathic pulmonary fibrosis |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5242828A (en) | 1988-11-10 | 1993-09-07 | Pharmacia Biosensor Ab | Sensing surfaces capable of selective biomolecular interactions, to be used in biosensor systems |
WO2000056934A1 (en) | 1999-03-24 | 2000-09-28 | Packard Bioscience Company | Continuous porous matrix arrays |
US6225047B1 (en) | 1997-06-20 | 2001-05-01 | Ciphergen Biosystems, Inc. | Use of retentate chromatography to generate difference maps |
US6329209B1 (en) | 1998-07-14 | 2001-12-11 | Zyomyx, Incorporated | Arrays of protein-capture agents and methods of use thereof |
US6537749B2 (en) | 1998-04-03 | 2003-03-25 | Phylos, Inc. | Addressable protein arrays |
WO2003048768A2 (en) | 2001-12-05 | 2003-06-12 | Sense Proteomic Limited | Protein arrays for allelic variants and uses thereof |
WO2004070058A1 (en) | 2003-02-03 | 2004-08-19 | Bayer Healthcare Ag | Methods and compositions for the prediction, diagnosis, prognosis, prevention and treatment of copd |
US20060211026A1 (en) | 2005-03-17 | 2006-09-21 | Roche Palo Alto Llc | Methods for assessing emphysema |
WO2006118522A1 (en) | 2005-04-29 | 2006-11-09 | Astrazeneca Ab | Peptides as biomarkers of copd |
US20070172902A1 (en) | 2005-06-22 | 2007-07-26 | The Johns Hopkins University, a non-profit organization | Biomarker for ovarian cancer |
WO2007084485A2 (en) | 2006-01-13 | 2007-07-26 | Battelle Memorial Institute | Markers for assessing copd-related diseases |
US20070178504A1 (en) * | 2005-12-22 | 2007-08-02 | Tracey Colpitts | Methods and marker combinations for screening for predisposition to lung cancer |
US20070269804A1 (en) * | 2004-06-19 | 2007-11-22 | Chondrogene, Inc. | Computer system and methods for constructing biological classifiers and uses thereof |
WO2008003066A2 (en) | 2006-06-28 | 2008-01-03 | Lineagen, Inc. | Apparatus, compositions, and methods for assessment of chronic obstructive pulmonary disease progression among rapid and slow decline conditions |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1587955A4 (en) * | 2002-12-24 | 2007-03-14 | Biosite Inc | Markers for differential diagnosis and methods of use thereof |
AU2004251181B2 (en) * | 2003-06-27 | 2009-10-01 | Axikin Pharmaceuticals, Inc. | Regulation of kinase, 'regulated in COPD kinase' (RC kinase) |
JP2007527239A (en) * | 2004-02-27 | 2007-09-27 | バイオシーク インコーポレイティッド | Biological data set profiling method for asthma and atopy |
US20070099239A1 (en) * | 2005-06-24 | 2007-05-03 | Raymond Tabibiazar | Methods and compositions for diagnosis and monitoring of atherosclerotic cardiovascular disease |
EP2005168A4 (en) * | 2006-03-09 | 2009-05-20 | Biosite Inc | Methods and compositions for the diagnosis of diseases of the aorta |
-
2009
- 2009-02-27 WO PCT/US2009/035376 patent/WO2009114292A1/en active Application Filing
- 2009-02-27 CA CA2718251A patent/CA2718251A1/en not_active Abandoned
- 2009-02-27 EP EP16184336.2A patent/EP3156925A3/en not_active Withdrawn
- 2009-02-27 JP JP2010550748A patent/JP5574990B2/en not_active Expired - Fee Related
- 2009-02-27 US US12/921,997 patent/US20110160070A1/en not_active Abandoned
- 2009-02-27 CN CN2009801138637A patent/CN102089654A/en active Pending
- 2009-02-27 EP EP09718959.1A patent/EP2269060B1/en not_active Not-in-force
-
2017
- 2017-07-14 US US15/650,702 patent/US20180004895A1/en not_active Abandoned
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5242828A (en) | 1988-11-10 | 1993-09-07 | Pharmacia Biosensor Ab | Sensing surfaces capable of selective biomolecular interactions, to be used in biosensor systems |
US6225047B1 (en) | 1997-06-20 | 2001-05-01 | Ciphergen Biosystems, Inc. | Use of retentate chromatography to generate difference maps |
US6537749B2 (en) | 1998-04-03 | 2003-03-25 | Phylos, Inc. | Addressable protein arrays |
US6329209B1 (en) | 1998-07-14 | 2001-12-11 | Zyomyx, Incorporated | Arrays of protein-capture agents and methods of use thereof |
WO2000056934A1 (en) | 1999-03-24 | 2000-09-28 | Packard Bioscience Company | Continuous porous matrix arrays |
WO2003048768A2 (en) | 2001-12-05 | 2003-06-12 | Sense Proteomic Limited | Protein arrays for allelic variants and uses thereof |
WO2004070058A1 (en) | 2003-02-03 | 2004-08-19 | Bayer Healthcare Ag | Methods and compositions for the prediction, diagnosis, prognosis, prevention and treatment of copd |
US20070269804A1 (en) * | 2004-06-19 | 2007-11-22 | Chondrogene, Inc. | Computer system and methods for constructing biological classifiers and uses thereof |
US20060211026A1 (en) | 2005-03-17 | 2006-09-21 | Roche Palo Alto Llc | Methods for assessing emphysema |
WO2006118522A1 (en) | 2005-04-29 | 2006-11-09 | Astrazeneca Ab | Peptides as biomarkers of copd |
US20070172902A1 (en) | 2005-06-22 | 2007-07-26 | The Johns Hopkins University, a non-profit organization | Biomarker for ovarian cancer |
US20070178504A1 (en) * | 2005-12-22 | 2007-08-02 | Tracey Colpitts | Methods and marker combinations for screening for predisposition to lung cancer |
WO2007084485A2 (en) | 2006-01-13 | 2007-07-26 | Battelle Memorial Institute | Markers for assessing copd-related diseases |
WO2008003066A2 (en) | 2006-06-28 | 2008-01-03 | Lineagen, Inc. | Apparatus, compositions, and methods for assessment of chronic obstructive pulmonary disease progression among rapid and slow decline conditions |
Non-Patent Citations (61)
Title |
---|
"R: A language and environment for statistical computing", 2006, R FOUNDATION FOR STATISTICAL COMPUTING |
ALDONYTE ET AL., COPD, vol. 1, 2004, pages 155 - 164 |
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 410 |
ANDREA PETERS; TORSTEN HOTHORN, IMPROVED PREDICTORS, 2004 |
AUSUBEL, F M ET AL.: "Current Protocols in Molecular Biology", vol. 4, 1996, JOHN WILEY & SONS, INC. |
BALBI ET AL., EUR. RESPIR. J., vol. 10, 1997, pages 846 - 850 |
BARCELO ET AL., CLIN. EXP. IMMUNOL., vol. 145, 2006, pages 474 - 479 |
BARNES, PHARMACOL. REV., vol. 56, 2004, pages 515 - 548 |
BENJAMINI; HOCKBERG; BENJAMINI; HOCKBERG, J. BEHAV. EDUC. STATIST., vol. 25, 2000, pages 60 - 83 |
BOX; COX, J. ROYAL STAT. SOC., SERIES B, vol. 26, 1964, pages 211 - 246 |
BREIMAN ET AL., MACH. LEARN., vol. 45, 2001, pages 5 - 32 |
CARPENTIER ET AL., CURR. MED. CHEM., vol. 11, 2004, pages 2205 - 2212 |
DAHL ET AL.: "Elevated plasma fibrinogen associated with reduced pulmonary function andincreased risk of chronic obstructive pulmonary disease.", AM JOUR RESP CRIT CARE MED, vol. 164, no. 6, 15 September 2001 (2001-09-15), pages 1008 - 1111, XP008142959 * |
D'AMBROSIO ET AL.: "Chemokines and their receptors guiding T lymphocyte recruitment in lung inflammation.", AM JOUR RESPIR CRIT CARE MED, vol. 164, no. 7, 2001, pages 1266 - 1275, XP008142957 * |
DE BOER ET AL.: "Monocyte chemoattractant protein 1, interleukin 8, and chronic airways inflammation in COPD.", JOUR PATHOL, vol. 190, no. 5, April 2000 (2000-04-01), pages 619 - 626, XP008142931 * |
DUDA: "Pattern Classification", 2001, JOHN WILEY & SONS, INC |
EFRON ET AL., ANNALS OF STATISTICS, vol. 32, 2004, pages 407 - 451 |
FABBRI ET AL., AM. J. RESPIR. CRIT. CARE MED., vol. 173, 2006, pages 1056 - 1065 |
FOSTER, J. ALLERGY CLIN. IMMUNOL., vol. 98, 1996, pages S270 - S277 |
GAN ET AL., THORAX, vol. 59, 2004, pages 574 - 580 |
GAP; BESTFIT; FASTA; TFASTA: "Wisconsin Genetics Software Package", GENETICS COMPUTER GROUP |
HASTIE ET AL.: "The Elements of Statistical Learning", 2001, SPRINGER |
HASTIE: "The Elements of Statistical Learning", 2001, SPRINGER |
HEFFNER, RESPIR. CARE, vol. 47, 2002, pages 586 - 607 |
HUANG ET AL., PROC. NATL. ACAD. SCI. USA, vol. 101, 2004, pages 10529 - 10534 |
JANHZ-ROZYK ET AL., POL. MERKUR. LEKARSKI., vol. 9, 2000, pages 649 - 652 |
JEFFERY, PETER K.: "Chronic Obstructive Pulmonary Disease: Pathogenesis to Treatment: Novartis Foundation Symposium", vol. 234, 2001, CHICHESTER: WILEY, article "Lymphocytes, Chronic Bronchitis and Chronic Obstructive Pulmonary Disease", pages: 149 - 168 |
KOIZUMI ET AL.: "Elevation of serum soluble vascular cell adhesion molecule-1 (sVCAM-1) levels in bronchial asthma.", CLIN EXP IMMUNOL, vol. 101, no. 3, September 1995 (1995-09-01), pages 468 - 473, XP008142958 * |
KOMURA ET AL., J. RHEUMATOL., vol. 34, 2007, pages 353 - 358 |
MCNEIL ET AL., PROC. NATL. ACAD. SCI. USA, vol. 87, 1990, pages 4120 - 4124 |
MOCALI ET AL., EXP. GERONTOL., vol. 39, 2004, pages 1555 - 1561 |
NEEDLEMAN; WUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 |
OWENS, AM. J. MED., vol. 91, 1991, pages 37S - 40S |
PEARSON; LIPMAN, PROC. NATL. ACAD. SCI. USA, vol. 85, 1988, pages 2444 |
PILETTE ET AL.: "Mucosal immunity in asthma and chronic obstructive pulmonary disease: a role for immunoglobulin A?", PROC AM THORAC SOC, vol. 1, no. 2, April 2004 (2004-04-01), pages 125 - 135, XP008142927 * |
PINTO-PLATA ET AL., THORAX, vol. 62, 2007, pages 595 - 601 |
PONS ET AL.: "Phenotypic characterisation of alveolar macrophages and peripheral blood monocytes in COPD.", EUR RESPIR JOUR, vol. 25, no. 4, April 2005 (2005-04-01), pages 647 - 652, XP008142937 * |
RANES; STOLLER, SEMIN. RESPIR. CRIT. CARE MED., vol. 26, 2005, pages 154 - 166 |
RENNARD, CHEST, vol. 113, 1998, pages 235S - 241 S |
REYNOLDS ET AL.: "Cigarette Smoke-Induced Egr-1 Upregulates Proinflammatory Cytokines in Pulmonary Epithelial Cells.", AMER JOUR RESP CELL MOL BIOL, vol. 35, no. 3, September 2006 (2006-09-01), pages 314 - 319, XP008142928 * |
RUCZINSKI ET AL., J. COMPUT. GRAPH. STAT., vol. 12, 2003, pages 475 - 512 |
SCHMILOVITZ-WEISS ET AL., APOPTOSIS, vol. 9, 2004, pages 205 - 210 |
SCHWABE ET AL., CLIN. EXP. IMMUNOL., vol. 117, 1999, pages 153 - 158 |
See also references of EP2269060A4 |
SHACHTER, CURR. OPIN. LIPIDOL., vol. 12, 2001, pages 297 - 304 |
SINGER: "Recursive Partitioning in the Health Sciences", 1999, SPRINGER |
SMITH; WATERMAN, ADV. APPL. MATH., vol. 2, 1981, pages 482 |
STAUB ET AL., AUTOIMMUN. REV., vol. 6, 2006, pages 104 - 106 |
STREITBERG; ROHMEL: "Exact Distribution for Permutations and Rank Test: An Introduction to Some Recently Published Algorithms", STATISTICAL SOFTWARE NEWSLETTER, vol. 12, 1986, pages 10 - 17 |
THERIEN ET AL.: "Adenovirus IL-13-induced airway disease in mice: a corticosteroid-resistant model of severe asthma.", AM JOUR RESPCELL MOL BIO., vol. 39, no. 1, 7 February 2008 (2008-02-07), pages 26 - 35, XP008142929 * |
TIBSHIRANI ET AL., PROC. NATL. ACAD. SCI. USA, vol. 99, 2002, pages 6567 - 72 |
TORRES ET AL.: "C-reactive protein levels and clinically important predictive outcomes in stable COPD patients.", EUR RESP JOUR., vol. 27, no. 5, May 2006 (2006-05-01), pages 902 - 907, XP008142960 * |
TURNBULL, CLASSIFICATION TREES WITH SUBSET ANALYSIS SELECTION BY THE LASSO, 2005 |
TUSHER ET AL., PROC. NATL. ACAD. SCI. USA, vol. 98, 2001, pages 5116 - 21 |
VENABLES, W.N.; RIPLEY, B.D.: "Modern Applied Statistics", 2002, SPRINGER |
VENABLES; RIPLEY: "Modern Applied Statistics with s-plus", 1997, SPRINGER |
VERNOOY ET AL.: "Local and systemic inflammation in patients with chronic obstructive pulmonary disease: soluble tumor necrosis factor receptors are increased in sputum.", AM J RESPIR CRIT CARE MED, vol. 166, no. 9, 1 November 2002 (2002-11-01), pages 1218 - 1224, XP002467301 * |
WOUTERS, PROC. AM. THORAC. SOC., vol. 2, 2005, pages 26 - 33 |
YAMASHITA; KURODA, CRIT. REV. IMMUNOL., vol. 22, 2002, pages 105 - 114 |
ZHANG ET AL., J. TONGJI MED. UNIV., vol. 19, 1999, pages 15 - 19 |
ZVI; LEVY, CLIN. LAB., vol. 52, 2006, pages 29 - 35 |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8812249B2 (en) | 2009-06-25 | 2014-08-19 | University Hospital Of North Staffordshire Nhs Trust | Analyzer apparatus and methods for lung disease |
WO2011100792A1 (en) * | 2010-02-16 | 2011-08-25 | Crc For Asthma And Airways Ltd | Protein biomarkers for obstructive airways diseases |
JP2013524202A (en) * | 2010-03-29 | 2013-06-17 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | COPD exacerbation detection using sputum analysis |
US10471086B2 (en) | 2011-03-07 | 2019-11-12 | Temple University—Of the Commonwealth System of Higher Education | Treatment of chronic obstructive pulmonary disease (COPD) |
EP3610260A4 (en) * | 2017-04-12 | 2021-01-06 | ProterixBio, Inc. | Biomarker combinations for monitoring chronic obstructive pulmonary disease and/or associated mechanisms |
Also Published As
Publication number | Publication date |
---|---|
JP5574990B2 (en) | 2014-08-20 |
EP3156925A3 (en) | 2017-06-21 |
CA2718251A1 (en) | 2009-09-17 |
US20180004895A1 (en) | 2018-01-04 |
EP2269060A1 (en) | 2011-01-05 |
EP2269060B1 (en) | 2016-08-17 |
US20110160070A1 (en) | 2011-06-30 |
CN102089654A (en) | 2011-06-08 |
EP3156925A2 (en) | 2017-04-19 |
JP2011521203A (en) | 2011-07-21 |
EP2269060A4 (en) | 2011-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20180004895A1 (en) | COPD Biomarker Signatures | |
US20080300797A1 (en) | Two biomarkers for diagnosis and monitoring of atherosclerotic cardiovascular disease | |
US20070099239A1 (en) | Methods and compositions for diagnosis and monitoring of atherosclerotic cardiovascular disease | |
JP2023178345A (en) | Method and system for determining autism spectrum disorder risk | |
US9423403B2 (en) | Chronic obstructive pulmonary disease (COPD) biomarkers and uses thereof | |
JP2014514572A (en) | Identification and diagnosis method of lung disease using classification system and kit | |
WO2011072177A2 (en) | Biomarker assay for diagnosis and classification of cardiovascular disease | |
US20140162370A1 (en) | Urine biomarkers for necrotizing enterocolitis and sepsis | |
Jiang et al. | RNA sequencing data from neutrophils of patients with cystic fibrosis reveals potential for developing biomarkers for pulmonary exacerbations | |
Conway et al. | Biomarker panels in critical care | |
WO2015164616A1 (en) | Biomarkers for detection of tuberculosis | |
Faro et al. | Polygenic risk scores and risk stratification in deep vein thrombosis | |
WO2016123058A1 (en) | Biomarkers for detection of tuberculosis risk | |
US20180356419A1 (en) | Biomarkers for detection of tuberculosis risk | |
Bonaroti et al. | Prognostic biomarkers to predict outcomes in trauma | |
Yang | Blood-based biomarkers of asthma | |
US20230048910A1 (en) | Methods of Determining Impaired Glucose Tolerance | |
WO2024015485A1 (en) | Methods of assessing dementia risk | |
Bhuiyan et al. | Role of Biomarkers in the Stratification of COVID-19 Disease Severity–A Review | |
Dong | Predictive biomarkers of pulmonary exacerbations in cystic fibrosis | |
EP4413372A1 (en) | Lung cancer prediction and uses thereof | |
WO2023278502A1 (en) | Renal health determination and uses thereof | |
JP2024525146A (en) | Prediction of renal failure and uses thereof | |
WO2020117184A1 (en) | Methods and compositions for providing a preeclampsia assessment by proteomics |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200980113863.7 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 09718959 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010550748 Country of ref document: JP Ref document number: 2718251 Country of ref document: CA |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 6616/DELNP/2010 Country of ref document: IN |
|
REEP | Request for entry into the european phase |
Ref document number: 2009718959 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009718959 Country of ref document: EP |