EP1449108A1 - Classement de caracteristiques pretraitees pour une machine a vecteur de support - Google Patents

Classement de caracteristiques pretraitees pour une machine a vecteur de support

Info

Publication number
EP1449108A1
EP1449108A1 EP02778747A EP02778747A EP1449108A1 EP 1449108 A1 EP1449108 A1 EP 1449108A1 EP 02778747 A EP02778747 A EP 02778747A EP 02778747 A EP02778747 A EP 02778747A EP 1449108 A1 EP1449108 A1 EP 1449108A1
Authority
EP
European Patent Office
Prior art keywords
data
features
genes
svm
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP02778747A
Other languages
German (de)
English (en)
Other versions
EP1449108A4 (fr
Inventor
Jason Weston
Andre 407 W. 51st Street ELLISSEEF
Bernhard SCHÖLKOPF
Fernando A.T.S.C. Univ. Carolos PEREZ-CRUZ III
Isabelle Guyon
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Health Discovery Corp
Original Assignee
BIOWULF TECHNOLOGIES LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BIOWULF TECHNOLOGIES LLC filed Critical BIOWULF TECHNOLOGIES LLC
Publication of EP1449108A1 publication Critical patent/EP1449108A1/fr
Publication of EP1449108A4 publication Critical patent/EP1449108A4/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • G06F18/2113Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

Des caractéristiques sont prétraitées (204) de manière à réduire les erreurs de classification dans des machines à vecteur de support (SVM) (200) utilisées pour identifier des formes dans des grandes bases de données. Le prétraitement (204) est exécuter de manière obliger les caractéristiques utilisées à former (210) la machine d'apprentissage à vecteur de support. Des données réelles (226) sont collectées et traitées (232) au moyen de la machine à vecteur de support.
EP02778747A 2001-11-07 2002-11-07 Classement de caracteristiques pretraitees pour une machine a vecteur de support Withdrawn EP1449108A4 (fr)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US34756201P 2001-11-07 2001-11-07
US347562P 2001-11-07
PCT/US2002/035576 WO2003040949A1 (fr) 2001-11-07 2002-11-07 Classement de caracteristiques pretraitees pour une machine a vecteur de support

Publications (2)

Publication Number Publication Date
EP1449108A1 true EP1449108A1 (fr) 2004-08-25
EP1449108A4 EP1449108A4 (fr) 2006-11-22

Family

ID=23364249

Family Applications (1)

Application Number Title Priority Date Filing Date
EP02778747A Withdrawn EP1449108A4 (fr) 2001-11-07 2002-11-07 Classement de caracteristiques pretraitees pour une machine a vecteur de support

Country Status (2)

Country Link
EP (1) EP1449108A4 (fr)
WO (1) WO2003040949A1 (fr)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005048185A1 (fr) * 2003-11-17 2005-05-26 Auckland University Of Technology Methode d'inference neuro-floue transductive pour la modelisation personnalisee
NZ572036A (en) 2008-10-15 2010-03-26 Nikola Kirilov Kasabov Data analysis and predictive systems and related methodologies
CA2766914C (fr) 2009-06-30 2019-02-26 Daniel Caraviello Extraction des regles d'association dans les ensembles de donnees de vegetaux et d'animaux et utilisation des fonctionnalites de classement ou prediction
US10535014B2 (en) 2014-03-10 2020-01-14 California Institute Of Technology Alternative training distribution data in machine learning
US9858534B2 (en) 2013-11-22 2018-01-02 California Institute Of Technology Weight generation in machine learning
US9953271B2 (en) 2013-11-22 2018-04-24 California Institute Of Technology Generation of weights in machine learning
US10558935B2 (en) 2013-11-22 2020-02-11 California Institute Of Technology Weight benefit evaluator for training data
US9658987B2 (en) 2014-05-15 2017-05-23 International Business Machines Corporation Regression using M-estimators and polynomial kernel support vector machines and principal component regression
US11139048B2 (en) 2017-07-18 2021-10-05 Analytics For Life Inc. Discovering novel features to use in machine learning techniques, such as machine learning techniques for diagnosing medical conditions
CN111329847A (zh) * 2020-03-19 2020-06-26 上海大学 利用二氢查尔酮类化合物对胰岛素促泌性能进行预报的方法及应用
CN111652393A (zh) * 2020-06-05 2020-09-11 国网信通亿力科技有限责任公司 基于大数据技术的电力设备异常预警方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001005935A2 (fr) * 1999-07-16 2001-01-25 Rosetta Inpharmatics, Inc. Conception de sonde iterative et etablissement de profils d'expression detailles avec jeux ordonnes d'echantillons adaptables de synthese in-situ
WO2001031579A2 (fr) * 1999-10-27 2001-05-03 Barnhill Technologies, Llc Procedes et dispositifs permettant d'identifier des motifs dans des systemes biologiques et procedes d'utilisation correspondants

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6192360B1 (en) * 1998-06-23 2001-02-20 Microsoft Corporation Methods and apparatus for classifying text and for building a text classifier

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001005935A2 (fr) * 1999-07-16 2001-01-25 Rosetta Inpharmatics, Inc. Conception de sonde iterative et etablissement de profils d'expression detailles avec jeux ordonnes d'echantillons adaptables de synthese in-situ
WO2001031579A2 (fr) * 1999-10-27 2001-05-03 Barnhill Technologies, Llc Procedes et dispositifs permettant d'identifier des motifs dans des systemes biologiques et procedes d'utilisation correspondants

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
EISEN M: "Cluster and TreeView Manual"[Online] 1998, - 1999 pages 1-20, XP002402140 Retrieved from the Internet: URL:http://rana.lbl.gov/manuals/ClusterTreeView.pdf> [retrieved on 2006-10-06] *
See also references of WO03040949A1 *
YOUNG A N ET AL: "Expression profiling of renal epithelial neoplasms/A METHOD FOR TUMOR CLASSIFICATION AND DISCOVERY OF DIAGNOSTIC MOLECULAR MARKERS" AMERICAN JOURNAL OF PATHOLOGY, PHILADELPHIA, PA, US, vol. 158, no. 5, May 2001 (2001-05), pages 1639-1651, XP002962291 ISSN: 0002-9440 *

Also Published As

Publication number Publication date
WO2003040949A1 (fr) 2003-05-15
EP1449108A4 (fr) 2006-11-22

Similar Documents

Publication Publication Date Title
US7624074B2 (en) Methods for feature selection in a learning machine
US7318051B2 (en) Methods for feature selection in a learning machine
US7805388B2 (en) Method for feature selection in a support vector machine using feature ranking
US7475048B2 (en) Pre-processed feature ranking for a support vector machine
US8095483B2 (en) Support vector machine—recursive feature elimination (SVM-RFE)
Shmilovici Support vector machines
JP5064625B2 (ja) パターンを同定するための方法及び機械
US8463718B2 (en) Support vector machine-based method for analysis of spectral data
US7617163B2 (en) Kernels and kernel methods for spectral data
EP1192595B1 (fr) Amelioration de la decouverte de connaissances a partir d'ensembles de donnees multiples au moyen de machines a vecteurs de soutien multiples
EP1393196A1 (fr) Noyaux et procedes de selection de noyaux a utiliser dans des machines a enseigner
Osareh et al. Microarray data analysis for cancer classification
WO2001031579A2 (fr) Procedes et dispositifs permettant d'identifier des motifs dans des systemes biologiques et procedes d'utilisation correspondants
CA2435254C (fr) Procedes d'identification de motifs dans des systemes biologiques et utilisations desdits procedes
Douzas et al. Geometric SMOTE: Effective oversampling for imbalanced learning through a geometric extension of SMOTE
WO2003040949A1 (fr) Classement de caracteristiques pretraitees pour une machine a vecteur de support
AU2002253879A1 (en) Methods of identifying patterns in biological systems and uses thereof
AU764897B2 (en) Pre-processing and post-processing for enhancing knowledge discovery using support vector machines
Altınçay Decision trees using model ensemble-based nodes
Pérez-Sánchez et al. Selecting target concept in one-class classification for handling class imbalance problem
Pizzi et al. Classifying high-dimensional patterns using a fuzzy logic discriminant network
Jrad et al. Gene-based multiclass cancer diagnosis with class-selective rejections
Renukadevi et al. INVESTIGATING THE PERFORMANCE OF OPTIMIZATION TECHNIQUES WITH SVM FOR MEDICAL IMAGE CLASSIFICATION

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20040517

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LI LU MC NL PT SE SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

RIN1 Information on inventor provided before grant (corrected)

Inventor name: GUYON, ISABELLE

Inventor name: PEREZ-CRUZ, FERNANDO,A.T.S.C.,UNIV. CAROLOS III

Inventor name: SCHOELKOPF, BERNHARD

Inventor name: ELLISSEEF, ANDRE

Inventor name: WESTON, JASON

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: BIOWULF TECHNOLOGIES, LLC

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: MCKENZIE, JOE

Owner name: CARLS, GARRY L.

Owner name: BERGERON, GLYNN

Owner name: O'HAYER, TIMOTHY P.

Owner name: SIMPSON, K. RUSSELL

Owner name: MATTHEWS, JOHN E.

Owner name: ANDERSON, CURTIS

Owner name: FARLEY, PETER J.

Owner name: PADEREWSKI, JULES B.

Owner name: ROBERTS, JAMES

Owner name: STERN, JULIAN N.

Owner name: MEMORIAL HEALTH TRUST, INC.

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: HEALTH DISCOVERY CORPORATION

RIC1 Information provided on ipc code assigned before grant

Ipc: G06F 15/18 20060101ALI20061010BHEP

Ipc: G06F 19/00 20060101AFI20061010BHEP

A4 Supplementary search report drawn up and despatched

Effective date: 20061019

17Q First examination report despatched

Effective date: 20070410

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: HEALTH DISCOVERY CORPORATION

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20110826