WO2001016805A3 - A system and method for mining data from a database using relevance networks - Google Patents

A system and method for mining data from a database using relevance networks Download PDF

Info

Publication number
WO2001016805A3
WO2001016805A3 PCT/US2000/024257 US0024257W WO0116805A3 WO 2001016805 A3 WO2001016805 A3 WO 2001016805A3 US 0024257 W US0024257 W US 0024257W WO 0116805 A3 WO0116805 A3 WO 0116805A3
Authority
WO
WIPO (PCT)
Prior art keywords
data
variables
association
strength
relevance
Prior art date
Application number
PCT/US2000/024257
Other languages
French (fr)
Other versions
WO2001016805A2 (en
Inventor
Atul Janardhan Butte
Isaac S Kohane
Original Assignee
Childrens Medical Center
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Childrens Medical Center filed Critical Childrens Medical Center
Priority to CA002383549A priority Critical patent/CA2383549A1/en
Priority to EP00959855A priority patent/EP1266305A2/en
Priority to AU71105/00A priority patent/AU7110500A/en
Priority to JP2001520685A priority patent/JP2003527662A/en
Publication of WO2001016805A2 publication Critical patent/WO2001016805A2/en
Publication of WO2001016805A3 publication Critical patent/WO2001016805A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/03Data mining
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Medical Informatics (AREA)
  • Data Mining & Analysis (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Epidemiology (AREA)
  • Evolutionary Computation (AREA)
  • Bioethics (AREA)
  • Public Health (AREA)
  • Software Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Described are a system and method for mining data in databases to discover significant relationships among variables in the data. An association is established between each pair of variables. From the data, the strength of the each association is calculated. Correlation coefficients can determine the strength of the associations. In another embodiment, the strength of each association is computed according to mutual information. These calculated strengths are evaluated according to a predetermined criterion. All associations that satisfy the criterion are included in one or more relevance networks. Each relevance network is displayed to provide a pictorial view of the relevant relationships among variables in the data.
PCT/US2000/024257 1999-09-02 2000-09-01 A system and method for mining data from a database using relevance networks WO2001016805A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CA002383549A CA2383549A1 (en) 1999-09-02 2000-09-01 A system and method for mining data from a database using relevance networks
EP00959855A EP1266305A2 (en) 1999-09-02 2000-09-01 A system and method for mining data from a database using relevance networks
AU71105/00A AU7110500A (en) 1999-09-02 2000-09-01 A system and method for mining data from a database using relevance networks
JP2001520685A JP2003527662A (en) 1999-09-02 2000-09-01 System and apparatus for retrieving data from a database using an associated network

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US15250099P 1999-09-02 1999-09-02
US60/152,500 1999-09-02
US15359399P 1999-09-13 1999-09-13
US60/153,593 1999-09-13
US43045099A 1999-10-29 1999-10-29
US09/430,450 1999-10-29

Publications (2)

Publication Number Publication Date
WO2001016805A2 WO2001016805A2 (en) 2001-03-08
WO2001016805A3 true WO2001016805A3 (en) 2002-09-26

Family

ID=27387262

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/024257 WO2001016805A2 (en) 1999-09-02 2000-09-01 A system and method for mining data from a database using relevance networks

Country Status (5)

Country Link
EP (1) EP1266305A2 (en)
JP (1) JP2003527662A (en)
AU (1) AU7110500A (en)
CA (1) CA2383549A1 (en)
WO (1) WO2001016805A2 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4199026B2 (en) * 2003-03-03 2008-12-17 富士通株式会社 Information relevance display method, program, storage medium, and apparatus
US7647287B1 (en) 2008-11-21 2010-01-12 International Business Machines Corporation Suggesting a relationship for a node pair based upon shared connections versus total connections
CN107025596B (en) 2016-02-01 2021-07-16 腾讯科技(深圳)有限公司 Risk assessment method and system
US11599840B2 (en) * 2018-02-25 2023-03-07 Graphen, Inc. System for discovering hidden correlation relationships for risk analysis using graph-based machine learning

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
ALON U ET AL: "BROAD PATTERNS OF GENE EXPRESSION REVEALED BY CLUSTERING ANALYSIS OF TUMOR AND NORMAL COLON TISSUES PROBED BY OLIGONUCLEOTIDE ARRAYS", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, NATIONAL ACADEMY OF SCIENCE. WASHINGTON, US, vol. 96, June 1999 (1999-06-01), pages 6745 - 6750, XP002901769, ISSN: 0027-8424 *
BASETT D E ET AL: "GENE EXPRESSION INFORMATICS - IT'S ALL IN YOUR MINE", NATURE GENETICS, NEW YORK, NY, US, vol. 21, no. SUPPL, January 1999 (1999-01-01), pages 51 - 55, XP000865988, ISSN: 1061-4036 *
CHEN T ET AL: "Identifying Gene Regulatory Networks from Experimental Data", RECOMB 99, ACM PRESS, April 1999 (1999-04-01), USA, XP002189969 *
CHEN Y ET AL: "CLUSTERING ANALYSIS FOR GENE EXPRESSION DATA", PROCEEDINGS OF THE SPIE, SPIE, BELLINGHAM, VA, US, vol. 3602, January 1999 (1999-01-01), pages 422 - 428, XP001001103 *
CLAVERIE, J.-M.: "Computational methods for the identification of differential and coordinated gene expression", HUMAN MOLECULAR GENETICS, OXFORD UNIVERSITY PRESS, vol. 8, no. 10, 1 September 1999 (1999-09-01), pages 1821 - 1832, XP002202819 *
D'HAESELEER, P. ET AL: "Gene Expression Data Analysis and Modeling", PACIFIC SYMPOSIUM ON BIOCOMPUTING 1999 (PSB99), TUTORIAL, 4 January 1999 (1999-01-04) - 9 January 1999 (1999-01-09), Hawaii, USA, pages 1 - 34, XP002203022 *
EISEN M B ET AL: "Cluster analysis and display of genome-wide expression patterns", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, NATIONAL ACADEMY OF SCIENCE. WASHINGTON, US, vol. 95, December 1998 (1998-12-01), pages 14863 - 14868, XP002140966, ISSN: 0027-8424 *
MICHAELS G S ET AL: "CLUSTER ANALYSIS AND DATA VISUALIZATION OF LARGE-SCALE GENE EXPRESSION DATA", PROCEEDINGS OF THE PACIFIC SYMPOSIUM ON BIOCOMPUTING, XX, XX, 1997, pages 42 - 53, XP000974575 *

Also Published As

Publication number Publication date
CA2383549A1 (en) 2001-03-08
AU7110500A (en) 2001-03-26
EP1266305A2 (en) 2002-12-18
WO2001016805A2 (en) 2001-03-08
JP2003527662A (en) 2003-09-16

Similar Documents

Publication Publication Date Title
HK1080317A1 (en) System and method for accessing contact information on a communication device
WO1999065204A3 (en) Wireless coupling of standardized networks and non-standardized nodes
DE50206893D1 (en) METHOD AND SYSTEM FOR COUPLING DATA NETWORKS
EP0773649A3 (en) Network topology management system
WO2001077858A3 (en) System and method for synchronizing data records between multiple databases
CA2423157A1 (en) System and method for design, tracking, measurement, prediction and optimization of data communication networks
EP0860786A3 (en) System and method for hierarchically grouping and ranking a set of objects in a query context
WO2002021662A3 (en) Battery monitoring network
MY127209A (en) Method for seismic facies interpretation using textural analysis and neural networks
WO2001031836A3 (en) Secured ad hoc network and method for providing the same
WO2002079949A3 (en) Internet security system
EP1202490A3 (en) Communication control apparatus and method
CA2207867A1 (en) Method and apparatus for providing an efficient use of telecommunication network resources
CA2368530A1 (en) Integrity check in a communication system
EP0903890A3 (en) Monitoring network traffic
WO2000014913A3 (en) Distributed communcations network management and control system
WO2006062915A3 (en) System and method for vital communications connectivity
WO2004051908A3 (en) System and method for providing secure communication between network nodes
WO2001097554A3 (en) System, method and computer program product for charging for competitive ip-over-wireless services
WO2004046857A3 (en) System and method for providing targeted discussion group meeting information and related items for sale
EP1089523A3 (en) Apparatus and method of configuring a network device
WO1998052323A1 (en) Packet transmitter
WO2001054350A3 (en) System and method for modeling communication networks
WO2004091084A3 (en) Apparatus and method to provide current location information services in a network
EP0898398A3 (en) Method and apparatus for re-synchronizing a network manager to its network agents

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2383549

Country of ref document: CA

ENP Entry into the national phase

Ref country code: JP

Ref document number: 2001 520685

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 71105/00

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2000959855

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWP Wipo information: published in national office

Ref document number: 2000959855

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2000959855

Country of ref document: EP