WO2012041861A3 - Computer-implemented method for analyzing multivariate data - Google Patents
Computer-implemented method for analyzing multivariate data Download PDFInfo
- Publication number
- WO2012041861A3 WO2012041861A3 PCT/EP2011/066787 EP2011066787W WO2012041861A3 WO 2012041861 A3 WO2012041861 A3 WO 2012041861A3 EP 2011066787 W EP2011066787 W EP 2011066787W WO 2012041861 A3 WO2012041861 A3 WO 2012041861A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- multivariate data
- subset
- computer
- implemented method
- projection score
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/30—Unsupervised data analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
- G06F18/2115—Selection of the most significant subset of features by evaluating different subsets according to an optimisation criterion, e.g. class separability, forward selection or backward elimination
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Mathematical Physics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Analysis (AREA)
- Computational Mathematics (AREA)
- Databases & Information Systems (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Algebra (AREA)
- Epidemiology (AREA)
- Public Health (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- Bioethics (AREA)
- General Health & Medical Sciences (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Operations Research (AREA)
- Probability & Statistics with Applications (AREA)
- Complex Calculations (AREA)
- Debugging And Monitoring (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
Abstract
A computer-implemented method for analyzing multivariate data comprising a plurality of samples of each of a plurality of measurement variables is disclosed. The method comprises, for a first subset φΑ (X) of the multivariate data X, determining (110) a first projection score related to the first subset. Furthermore, the method comprises, for a second subset φB (X) of the multivariate data X, determining (120) a second projection score related to the second subset. Moreover, the method comprises, comparing (130) the first and the second projection score for determining which one of the first and the second subset provides the most informative representation of the multivariate data, which is defined as the one of said subsets having the highest related projection score. A definition of the projection score is also provided.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/876,182 US20130304783A1 (en) | 2010-09-27 | 2011-09-27 | Computer-implemented method for analyzing multivariate data |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP10180086.0 | 2010-09-27 | ||
EP10180086A EP2434411A1 (en) | 2010-09-27 | 2010-09-27 | Computer-implemented method for analyzing multivariate data |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2012041861A2 WO2012041861A2 (en) | 2012-04-05 |
WO2012041861A3 true WO2012041861A3 (en) | 2012-08-23 |
Family
ID=44201945
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2011/066787 WO2012041861A2 (en) | 2010-09-27 | 2011-09-27 | Computer-implemented method for analyzing multivariate data |
Country Status (3)
Country | Link |
---|---|
US (1) | US20130304783A1 (en) |
EP (1) | EP2434411A1 (en) |
WO (1) | WO2012041861A2 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9785890B2 (en) * | 2012-08-10 | 2017-10-10 | Fair Isaac Corporation | Data-driven product grouping |
CN103700065B (en) * | 2013-12-03 | 2016-07-06 | 杭州电子科技大学 | A kind of structure sparse propagation image repair method of tagsort study |
US20160149776A1 (en) * | 2014-11-24 | 2016-05-26 | Cisco Technology, Inc. | Anomaly detection in protocol processes |
US9792254B2 (en) * | 2015-09-25 | 2017-10-17 | International Business Machines Corporation | Computing intersection cardinality |
CN106296606B (en) * | 2016-08-04 | 2019-07-23 | 杭州电子科技大学 | A kind of classification rarefaction representation image repair method of edge fitting |
EP3559822A4 (en) * | 2016-12-22 | 2020-08-19 | Liveramp, Inc. | Mixed data fingerprinting with principal components analysis |
US10635939B2 (en) | 2018-07-06 | 2020-04-28 | Capital One Services, Llc | System, method, and computer-accessible medium for evaluating multi-dimensional synthetic data using integrated variants analysis |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7587285B2 (en) * | 2007-08-31 | 2009-09-08 | Life Technologies Corporation | Method for identifying correlated variables |
US20100145624A1 (en) * | 2008-12-04 | 2010-06-10 | Syngenta Participations Ag | Statistical validation of candidate genes |
-
2010
- 2010-09-27 EP EP10180086A patent/EP2434411A1/en not_active Ceased
-
2011
- 2011-09-27 US US13/876,182 patent/US20130304783A1/en not_active Abandoned
- 2011-09-27 WO PCT/EP2011/066787 patent/WO2012041861A2/en active Application Filing
Non-Patent Citations (2)
Title |
---|
MAGNUS FONTES ET AL: "The projection score - an evaluation criterion for variable subset selection in PCA visualization", BMC BIOINFORMATICS, vol. 12, no. 1, 28 July 2011 (2011-07-28), pages 307, XP055030746, ISSN: 1471-2105, DOI: 10.1186/1471-2105-12-307 * |
MAGNUS FONTES: "Statistical and knowledge supported visualization of multivariate data", 31 August 2010 (2010-08-31), XP055002519, Retrieved from the Internet <URL:http://arxiv.org/PS_cache/arxiv/pdf/1008/1008.5374v1.pdf> [retrieved on 20110711] * |
Also Published As
Publication number | Publication date |
---|---|
WO2012041861A2 (en) | 2012-04-05 |
EP2434411A1 (en) | 2012-03-28 |
US20130304783A1 (en) | 2013-11-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2012041861A3 (en) | Computer-implemented method for analyzing multivariate data | |
WO2012012664A3 (en) | Image reporting method | |
TW200636411A (en) | Automated throughput control system and method of operating the same | |
WO2009073664A3 (en) | Rating raters | |
WO2006133125A3 (en) | Dynamic model generation methods and apparatus | |
WO2011035298A3 (en) | Methods and apparatus to perform choice modeling with substitutability data | |
WO2009126553A3 (en) | Dynamic integration of disparate health-related processes and data | |
WO2012037578A3 (en) | Sales prediction and recommendation system | |
WO2013098663A3 (en) | Cell clustering and aperture selection | |
WO2007059088A3 (en) | Interferometer and method for measuring characteristics of optically unresolved surface features | |
WO2012145616A3 (en) | Predictive modeling | |
WO2007002729A3 (en) | Method and system for predicting consumer behavior | |
WO2009052514A3 (en) | Methods of identifying environmentally friendly businesses or individuals | |
WO2012177817A3 (en) | Systems and methods for identifying a contributor's str genotype based on a dna sample having multiple contributors | |
WO2011156799A3 (en) | Detecting state estimation network model data errors | |
WO2014070306A3 (en) | System and method for applying a business rule management system to a customer relationship management system | |
EA201290770A1 (en) | METHOD OF MAINTENANCE OF THE PIPELINE | |
WO2014105745A3 (en) | Seismic data analysis | |
WO2013131025A3 (en) | Product cycle analysis using social media data | |
WO2012006148A3 (en) | Genotype simulation estimates mis-classification rate in genotyping | |
WO2009132126A4 (en) | Method for predicting risk of metastasis | |
WO2010107581A3 (en) | System for cross-integration of consumer loyalty programs and method thereof | |
WO2012047214A3 (en) | Visual display of semantic information | |
BR112014012003A2 (en) | computer readable quality control method, method and medium for use with consumer goods, users and biological / environmental diagnostic test devices | |
WO2008032176A3 (en) | A method for generating contact groups |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11761086 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13876182 Country of ref document: US |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 11761086 Country of ref document: EP Kind code of ref document: A2 |