WO2012041861A3 - Computer-implemented method for analyzing multivariate data - Google Patents

Computer-implemented method for analyzing multivariate data Download PDF

Info

Publication number
WO2012041861A3
WO2012041861A3 PCT/EP2011/066787 EP2011066787W WO2012041861A3 WO 2012041861 A3 WO2012041861 A3 WO 2012041861A3 EP 2011066787 W EP2011066787 W EP 2011066787W WO 2012041861 A3 WO2012041861 A3 WO 2012041861A3
Authority
WO
WIPO (PCT)
Prior art keywords
multivariate data
subset
computer
implemented method
projection score
Prior art date
Application number
PCT/EP2011/066787
Other languages
French (fr)
Other versions
WO2012041861A2 (en
Inventor
Magnus Fontes
Original Assignee
Qlucore Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qlucore Ab filed Critical Qlucore Ab
Priority to US13/876,182 priority Critical patent/US20130304783A1/en
Publication of WO2012041861A2 publication Critical patent/WO2012041861A2/en
Publication of WO2012041861A3 publication Critical patent/WO2012041861A3/en

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/30Unsupervised data analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • G06F18/2115Selection of the most significant subset of features by evaluating different subsets according to an optimisation criterion, e.g. class separability, forward selection or backward elimination
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Algebra (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Bioethics (AREA)
  • General Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Operations Research (AREA)
  • Probability & Statistics with Applications (AREA)
  • Complex Calculations (AREA)
  • Debugging And Monitoring (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)

Abstract

A computer-implemented method for analyzing multivariate data comprising a plurality of samples of each of a plurality of measurement variables is disclosed. The method comprises, for a first subset φΑ (X) of the multivariate data X, determining (110) a first projection score related to the first subset. Furthermore, the method comprises, for a second subset φB (X) of the multivariate data X, determining (120) a second projection score related to the second subset. Moreover, the method comprises, comparing (130) the first and the second projection score for determining which one of the first and the second subset provides the most informative representation of the multivariate data, which is defined as the one of said subsets having the highest related projection score. A definition of the projection score is also provided.
PCT/EP2011/066787 2010-09-27 2011-09-27 Computer-implemented method for analyzing multivariate data WO2012041861A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/876,182 US20130304783A1 (en) 2010-09-27 2011-09-27 Computer-implemented method for analyzing multivariate data

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP10180086.0 2010-09-27
EP10180086A EP2434411A1 (en) 2010-09-27 2010-09-27 Computer-implemented method for analyzing multivariate data

Publications (2)

Publication Number Publication Date
WO2012041861A2 WO2012041861A2 (en) 2012-04-05
WO2012041861A3 true WO2012041861A3 (en) 2012-08-23

Family

ID=44201945

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2011/066787 WO2012041861A2 (en) 2010-09-27 2011-09-27 Computer-implemented method for analyzing multivariate data

Country Status (3)

Country Link
US (1) US20130304783A1 (en)
EP (1) EP2434411A1 (en)
WO (1) WO2012041861A2 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9785890B2 (en) * 2012-08-10 2017-10-10 Fair Isaac Corporation Data-driven product grouping
CN103700065B (en) * 2013-12-03 2016-07-06 杭州电子科技大学 A kind of structure sparse propagation image repair method of tagsort study
US20160149776A1 (en) * 2014-11-24 2016-05-26 Cisco Technology, Inc. Anomaly detection in protocol processes
US9792254B2 (en) * 2015-09-25 2017-10-17 International Business Machines Corporation Computing intersection cardinality
CN106296606B (en) * 2016-08-04 2019-07-23 杭州电子科技大学 A kind of classification rarefaction representation image repair method of edge fitting
EP3559822A4 (en) * 2016-12-22 2020-08-19 Liveramp, Inc. Mixed data fingerprinting with principal components analysis
US10635939B2 (en) 2018-07-06 2020-04-28 Capital One Services, Llc System, method, and computer-accessible medium for evaluating multi-dimensional synthetic data using integrated variants analysis

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7587285B2 (en) * 2007-08-31 2009-09-08 Life Technologies Corporation Method for identifying correlated variables
US20100145624A1 (en) * 2008-12-04 2010-06-10 Syngenta Participations Ag Statistical validation of candidate genes

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MAGNUS FONTES ET AL: "The projection score - an evaluation criterion for variable subset selection in PCA visualization", BMC BIOINFORMATICS, vol. 12, no. 1, 28 July 2011 (2011-07-28), pages 307, XP055030746, ISSN: 1471-2105, DOI: 10.1186/1471-2105-12-307 *
MAGNUS FONTES: "Statistical and knowledge supported visualization of multivariate data", 31 August 2010 (2010-08-31), XP055002519, Retrieved from the Internet <URL:http://arxiv.org/PS_cache/arxiv/pdf/1008/1008.5374v1.pdf> [retrieved on 20110711] *

Also Published As

Publication number Publication date
WO2012041861A2 (en) 2012-04-05
EP2434411A1 (en) 2012-03-28
US20130304783A1 (en) 2013-11-14

Similar Documents

Publication Publication Date Title
WO2012041861A3 (en) Computer-implemented method for analyzing multivariate data
WO2012012664A3 (en) Image reporting method
TW200636411A (en) Automated throughput control system and method of operating the same
WO2009073664A3 (en) Rating raters
WO2006133125A3 (en) Dynamic model generation methods and apparatus
WO2011035298A3 (en) Methods and apparatus to perform choice modeling with substitutability data
WO2009126553A3 (en) Dynamic integration of disparate health-related processes and data
WO2012037578A3 (en) Sales prediction and recommendation system
WO2013098663A3 (en) Cell clustering and aperture selection
WO2007059088A3 (en) Interferometer and method for measuring characteristics of optically unresolved surface features
WO2012145616A3 (en) Predictive modeling
WO2007002729A3 (en) Method and system for predicting consumer behavior
WO2009052514A3 (en) Methods of identifying environmentally friendly businesses or individuals
WO2012177817A3 (en) Systems and methods for identifying a contributor&#39;s str genotype based on a dna sample having multiple contributors
WO2011156799A3 (en) Detecting state estimation network model data errors
WO2014070306A3 (en) System and method for applying a business rule management system to a customer relationship management system
EA201290770A1 (en) METHOD OF MAINTENANCE OF THE PIPELINE
WO2014105745A3 (en) Seismic data analysis
WO2013131025A3 (en) Product cycle analysis using social media data
WO2012006148A3 (en) Genotype simulation estimates mis-classification rate in genotyping
WO2009132126A4 (en) Method for predicting risk of metastasis
WO2010107581A3 (en) System for cross-integration of consumer loyalty programs and method thereof
WO2012047214A3 (en) Visual display of semantic information
BR112014012003A2 (en) computer readable quality control method, method and medium for use with consumer goods, users and biological / environmental diagnostic test devices
WO2008032176A3 (en) A method for generating contact groups

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11761086

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 13876182

Country of ref document: US

122 Ep: pct application non-entry in european phase

Ref document number: 11761086

Country of ref document: EP

Kind code of ref document: A2