WO2003072701A1 - Systeme destine a analyser des puces a adn au moyen d'une ontologie genetique et methode associee - Google Patents

Systeme destine a analyser des puces a adn au moyen d'une ontologie genetique et methode associee Download PDF

Info

Publication number
WO2003072701A1
WO2003072701A1 PCT/KR2003/000400 KR0300400W WO03072701A1 WO 2003072701 A1 WO2003072701 A1 WO 2003072701A1 KR 0300400 W KR0300400 W KR 0300400W WO 03072701 A1 WO03072701 A1 WO 03072701A1
Authority
WO
WIPO (PCT)
Prior art keywords
optimal branch
gene
distance
pseudo
cluster
Prior art date
Application number
PCT/KR2003/000400
Other languages
English (en)
Inventor
Yang-Suk Kim
Jung-Uk Hur
Sung-Geun Lee
Original Assignee
Istech Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Istech Co., Ltd. filed Critical Istech Co., Ltd.
Priority to AU2003212669A priority Critical patent/AU2003212669A1/en
Publication of WO2003072701A1 publication Critical patent/WO2003072701A1/fr

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase
    • C12Q1/6837Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips

Definitions

  • the present invention relates to a system for DNA microarray analysis using
  • Gene OntologyTM and a method thereof and more specifically to a system and a method for biologically analyzing a gene expression pattern of a DNA chip or microarray assays by modeling of a hierarchical structure of gene ontology (hereinafter referred to as "GO") .
  • GO hierarchical structure of gene ontology
  • Biochips are broadly divided into microarray chips and microfluidics chips.
  • a microarray chip contains thousands of or tens of thousands of DNA or protein samples arranged at regular intervals, and thus can process analyte to identify its binding pattern.
  • Microarray chips generally refer to DNA chips and protein chips. DNA chips have been the most dominant biochips up to date. Microfluidics chips pass over a small amount of analyte in controlled flow and analyze the reaction pattern of the analyte with the molecule on a chip or with a sensor.
  • DNA chips are made by spotting a target DNA, cDNA or oligonucleotide on a glass slide, nitrocellulose membrane or silicon.
  • DNA chips consist of a small-sized solid on which cDNA or oligonucleotide probes with known base sequences are micro-arrayed at predetermined positions.
  • DNA chips if hybridized with a probe labeled with a radioactive isotope or fluorescent dye, can be used in identification of gene mutations and levels of gene expression, single nucleotide polymorphism (SNP), diagnosis of diseases, high-throughput screening (HTS) and so on.
  • SNP single nucleotide polymorphism
  • HTS high-throughput screening
  • a sample DNA fragment to be analyzed is combined to a DNA chip, the probe affixed to the DNA chip and the base sequence of the sample DNA fragment are hybridized depending on the level of complementarity. It is possible to analyze the base sequence of the sample DNA by detecting and understanding the hybridization by an optical or radioactive chemical method. If DNA chips are utilized, expression information of genes can be easily and rapidly obtained. DNA chips are now used for development of new drugs and medical diagnosis.
  • the Swiss-Prot provides protein information, classifies the functions of proteins by keywords.
  • keywords used for the classification, which makes it difficult to perform automated biological analyses of DNA chips.
  • the group information of the particular fields such as the CGAP (Cancer Genome Anatomy Project)
  • CGAP Cancer Genome Anatomy Project
  • Ontology means a classification system for biological terms and vocabularies.
  • the goal of the Gene Ontology Consortium is to construct a controlled and unified system of biological terms.
  • the Consortium provides about 10,000 dynamic controlled terms - the number of terms is vaiying when necessary - that can be applied to describe the roles of genes and gene products in all organisms.
  • Gene OntologyTM(GO) shows the relationships between genes and the keywords assigned for each gene and it is applicable to bioinformatics.
  • the GO terms amounting to about 10,000, have a tree-like hierarchical structure called DAG(Directed Acyclic Graph) and are divided into three categories.
  • the GO terms can be used to find a biological meaning when analyzing a DNA chip.
  • GO terms are classified into three categories reflecting the biological roles of genes: i) molecular function, ii) biological process and iii) cellular component.
  • Hierarchically controlled vocabularies are established for each category.
  • the three categories are not exclusive but all descriptive of a gene.
  • the present invention has been made in view of the above-mentioned problems, and it is an object of the present invention to provide a system and a method for DNA chip analysis utilizing Gene OntologyTM to enable a systematic biological analysis of a gene expression pattern of a DNA chip test by modeling of a GO hierarchy.
  • Another object of the present invention is to provide a method for extracting representative functions which are most common and ideal among the genes contained in a cluster formed by statistical clustering of DNA chip test results, utilizing GO terms and hierarchical tree structure.
  • FIG. 1 is a view showing the construction of a system for DNA chip analysis using Gene OntologyTM according to the present invention.
  • FIG. 2 shows one example of the GO tree structure according to the present invention.
  • FIG. 3 shows one example of a modification of the GO tree structure in text format according to the present invention.
  • FIG. 4 shows one example of a conversion of extracted GO codes according to the present invention.
  • FIG. 5 is a view briefly showing the principle of finding an optimal branch using GO according to the present invention.
  • FIG. 6 is a view showing the principle of measuring a pseudo-distance according to the present invention.
  • FIG. 7 is an operation flow of the analysis of a DNA chip using GO according to the present invention.
  • a system for DNA chip analysis using Gene OntologyTM comprising: a) means for receiving statistical clustering results of DNA chip data and for assigning appropriate GO identifiers to each gene pertaining to given cluster; b) means for converting each GO identifier assigned to said gene into a GO code using a GO code file; c) means for selecting a proper process among three predetermined process that adopt pseudo-distance to designate necessary parameters and for extracting an optimal branch; and d) means for extracting biological meanings from each extracted optimal branch; and e) optionally, visualizing means for displaying the optimal branch, GO code and biological meanings of a given cluster of gene.
  • the visualizing means displays summarized information on the GO code, optimal branch and biological meanings of a given cluster of genes in a form of table, or in a form of a graphical tree structure.
  • the optimal branch gives a proper weight to each level of the GO tree structure.
  • the pseudo-distance "Pd(vl,v2)," wherein vl and v2 represent nodes, is a weight of the level corresponding to the GO code of the optimal branch formed by nodes vl and v2.
  • the Pd value is zero.
  • max_pd(G) max ⁇ pd(Vj,Vj) ⁇ (1 ⁇ i ⁇ j ⁇ n)
  • aver_pd(G) (sum of all pd(v ⁇ ,V j ) in G)/ n
  • C 2 2 x (sum of all pd(vj,V j ) in G)/n(n-l)
  • max_pd max_pd
  • aver_pd aver_pd
  • the average pseudo-distance (aver_pd) shows how well genes are clustered with the same functional categories, and how frequently similar codes are observed.
  • the predetermined process of the means for extracting an optimal branch comprises i) basic process, ii) N-level selective process and iii) percentage selective process.
  • the means designates a proper process among them and necessary parameters to extract an optimal branch.
  • the basic process utilizes the maximum pseudo-distance (max_pd) and average pseudo-distance (aver_pd) of all nodes in the GO tree structure. The results obtained by the basic process roughly show the biological meanings of a given cluster.
  • the N-level selective process predesignates and computes each level of the optimal branch, observes formation of the optimal branch at a particular level N and analogizes the biological meaning at a lower level.
  • the percentage selective process predesignates the percentage of genes pertaining to the optimal branch and shows all combinations of genes in percentages desired by a user.
  • the N-level selective process shows both the first candidate of GO code combination and the next candidate of combinations to reflect the diversity that a single gene can be involved in two or more functions.
  • the predetermined process comprises i) basic process, ii) N-level selective process and iii) percentage selective process. Proper one of these processes and necessary parameters are designated to extract an optimal branch.
  • FIG. 1 is a view showing the construction of a system for DNA chip analysis using GO.
  • the system for DNA chip analysis comprises: an input section(ll ⁇ ) for inputting statistical clustering results of said DNA chip data; GO identifier assigning section(130) for assigning GO identifiers to each gene pertaining to each cluster for inputting clustering results using a GO identifier index file(120); GO identifier/GO code converting section(140) for converting each GO identifier assigned to the corresponding genes into a GO code utilizing GO code file; an optimal branch extracting section(220) for selecting a predetermined process according to pseudo-distance algorithm(210) to designate a necessary parameter for said GO code and extracting an optimal branch; and biological meanings extracting section(230) for extracting biological meanings from each optimal branch.
  • the system may further comprise a visualization module(310) for displaying the extracted optimal branch, the GO code and the biological meanings over a cluster.
  • the present invention assigns GO terms to each gene, extracts an optimal branch by mathematically utilizing the GO hierarchical tree structure, and efficiently displays the results of the optimal branch extraction.
  • FIG. 2 shows an example of the GO tree structure according to the present invention.
  • the highest level refers to GO level.
  • the second level refers to three categories, i.e., molecular function, biological process and cellular component.
  • the lower levels (3 rd , 4 th and 5 th ) form a tree-like inheritance structure.
  • FIG. 3 shows an example of a modification of the GO tree in a text format according to the present invention.
  • GO is not a tree structure in original form but a mathematical graph called DAG, directed graph without cycle.
  • the GO structure can be simply changed to a GO tree structure in the present invention.
  • FIG. 4 shows an example of conversion from GO terms to GO codes according to the present invention. This drawing illustrates the outputs of GO codes converted by the GO code converting section(140).
  • An optimal branch refers to the lowest nodes among the nodes including the greatest number of genes at the bottom in a tree structure.
  • the optimal branch is a broad term representing all the functions of genes included in the nodes at the bottom.
  • the system of the present invention assigns genes pertaining to a given cluster in the GO tree structure, finds the optimal branch through the pseudo-distance algorithm and displays the results.
  • GO terms are assigned to corresponding genes by text mining of various biological databases. For the allocation of GO terms, such information, at the DNA level or the protein level as provided by UniGene, LocusLink, Swiss-Prot and MGI, is utilized together with direct comparison of identifiers and sequence similarity searches. Also, gene identifier conversion files provided by each database that participated in the GO Consortium are utilized to assign GO terms.
  • UniGene of the NCBI National Center of Biotechnology Information
  • LocusLink which is the result of the reference sequence project of the NCBI, provides information about the functions of genes and representative sequences.
  • Swiss-Prot of the Swiss Institute of Bioinformatics provides gene information at the protein level.
  • MGI Mae Genome Informatics
  • each GO code is a sequence of numbers, here 15 numbers. Note that the sequence length 15 is variable according to the versions of GO syntax files. Each number in a GO code sequence represents the positional information at each step. Since a unique GO code is assigned to each node in GO tree, GO terms are distinguishable even if the same terms are used in different nodes in the GO tree structure.
  • FIG. 5 is a view for briefly explaining the principle of finding an optimal branch using GO tree according to the present invention.
  • the optimal branch can be found using GO codes.
  • a weight and a pseudo-distance between GO codes is defined.
  • FIG. 6 is a view for explaining the principle of measuring a pseudo-distance between nodes according to the present invention.
  • the pseudo-distance is defined as follows.
  • Pd(vl,v2) is a weight of the level at which there exists the optimal branch formed by nodes vl and v2. When vl and v2 are the same, the Pd value is zero. Ultimately, a combination of GO codes is selected through the following pseudo-distance concept.
  • pd(vl,v2) weight of the level where the optimal branch between vl and v2 is located(when vl ⁇ v2)
  • maximum pseudo-distance (max_pd) is used to roughly evaluate clusters. If the optimal branch of a cluster is located at a higher level, the cluster is likely to include bad genes which do not share the common characteristics with the other genes in that cluster.
  • the average pseudo-distance (aver_pd) shows how well genes are clustered in a given cluster with similar functional categories, and how frequently similar GO codes are observed.
  • the pseudo-distance is applicable to three processes: a basic process, an
  • N-level selective process and a percentage selective process N-level selective process and a percentage selective process.
  • the basic process has two modules using the maximum pseudo-distance (max_pd) and average pseudo-distance (aver _pd) of all nodes in GO tree structure.
  • the results of basic process show the overall biological meanings of a given cluster.
  • a user can designate particular limits.
  • the N-level selective process predesignates the level of an optimal branch so that the formation of the optimal branch at a particular level N can be easily computed.
  • the N-level selective process enables the user to easily analogize the biological meanings at a lower level, which is not possible in the basic process.
  • the N-level selective process shows both the first candidate of GO code combination and the next candidates of combinations to reflect the diversity that a single gene can be involved in two or more functions.
  • the percentage selective process predesignates the percentage of genes pertaining to the optimal branch and finds all combinations of genes in percentages desired by a user. Like the N-level selective process, the percentage selective process can fully show the functional diversity of genes.
  • FIG. 7 is the flow diagram of DNA chip analysis using GO according to the present invention.
  • the method of analysis comprises the steps of: receiving statistical clustering results of DNA chip data (S10) and assigning GO identifiers to each gene pertaining to given cluster (S20); converting each GO identifier assigned to the corresponding genes into a GO code using GO code file (S30); selecting a process among basic process (S41), N-level selective process (S42) and percentage selective process (S43) according to pseudo-distance algorithm (S40) to designate a necessary parameter for said GO code and extracting an optimal branch (S50); extracting a biological meaning of each extracted optimal branch (S60); and displaying an optimal branch of a cluster and its GO code (S70).
  • the system for biological analysis of the gene expression pattern of DNA chip using GO structure according to the present invention is comprised of three broad sections 100, 200 and 300. The operation of each section will be described in detail by reference to FIG. 7.
  • GO identifiers and their GO codes are assigned to each gene in a given cluster that is obtained from a statistical clustering method. More specifically, when clustering results are inputted (S10), GO identifiers are assigned to each gene within a cluster (S20) based on the index file that has previously assigned GO identifiers to genes through data mining of various databases. Subsequently, each GO identifier assigned to genes in a given cluster is converted into a GO code (S30) using the GO code file which all nodes in GO tree structures are all coded.
  • a proper process among basic process (S41), N-level selective process (S42) and percentage selective process (S43) is chosen using pseudo algorithm, and necessary parameters are designated.
  • An optimal branch is then computed (S50) based on the pseudo distance in each process. Also, biological meanings of the optimal branch are extracted.
  • the optimal branch extracted for genes in each cluster and the GO code assigned to the genes are displayed. Summarized information on the GO code for each gene, the optimal branch and the biological meanings can be displayed in the form of a table or a graphical tree.
  • the pseudo algorithm is also applicable to a different biochip, protein chip.
  • the pseudo-distance algorithm can be utilized to analyze a protein chip in the same way as utilized to analyze a DNA chip in FIGs. 1 and 7.
  • the present invention enables a systematic and automated biological analysis of gene expression patterns of DNA chip assays by a mathematical modeling of GO hierarchy. Also, the present invention can extract the biological functions that are commonest and most optimal among genes within a cluster formed by a statistical clustering method of DNA chip data, utilizing GO terms and tree structure.

Landscapes

  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Medical Informatics (AREA)
  • Genetics & Genomics (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Physiology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

La présente invention concerne un système et une méthode destinés à l'analyse biologique de motifs d'expression génétique de puces à ADN ou de microréseaux à ADN au moyen d'une modélisation mathématique d'une structure hiérarchique Gene Ontology™. L'invention concerne notamment un système destiné à l'analyse de données de puces à ADN au moyen d'une ontologie Gene Ontology™(GO), et comprenant (a) des moyens permettant de recevoir des résultats d'agrégation statistique de données de puces à ADN et d'attribuer des identificateurs GO appropriés à chaque gène appartenant à un groupe donné, (b) des moyens destinés à convertir chaque identificateur GO attribué à ce gène en un code GO utilisant un fichier de code GO, (c) des moyens servant à sélectionner un processus approprié parmi trois processus prédéterminés adoptant une pseudo-distance pour désigner des paramètres nécessaires, ainsi qu'à extraire une branche optimale, (d) des moyens destinés à extraire des significations biologiques à partir de chaque branche optimale extraite, et, éventuellement, (e) des moyens de visualisation permettant d'afficher la branche optimale, le code GO et les significations biologiques d'un groupe de gènes donné. La présente invention permet une analyse systématique et automatisée de motifs d'expression génétique d'une puce à ADN par modélisation d'une hiérarchie GO.
PCT/KR2003/000400 2002-02-28 2003-02-28 Systeme destine a analyser des puces a adn au moyen d'une ontologie genetique et methode associee WO2003072701A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003212669A AU2003212669A1 (en) 2002-02-28 2003-02-28 A system for analyzing dna-chips using gene ontology and a method thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2002-0010826 2002-02-28
KR10-2002-0010826A KR100431620B1 (ko) 2002-02-28 2002-02-28 유전자 어휘 분류체계를 이용하여 디엔에이 칩을 분석하기위한 시스템 및 그 방법

Publications (1)

Publication Number Publication Date
WO2003072701A1 true WO2003072701A1 (fr) 2003-09-04

Family

ID=27764625

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2003/000400 WO2003072701A1 (fr) 2002-02-28 2003-02-28 Systeme destine a analyser des puces a adn au moyen d'une ontologie genetique et methode associee

Country Status (3)

Country Link
KR (1) KR100431620B1 (fr)
AU (1) AU2003212669A1 (fr)
WO (1) WO2003072701A1 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567314A (zh) * 2010-12-07 2012-07-11 中国电信股份有限公司 知识查询装置和方法
US8396872B2 (en) 2010-05-14 2013-03-12 National Research Council Of Canada Order-preserving clustering data analysis system and method
CN103366098A (zh) * 2013-07-24 2013-10-23 国家电网公司 一种基于实验资源树的实验能力定量评价方法

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20050096044A (ko) * 2004-03-29 2005-10-05 주식회사 이즈텍 유전자 기능 분석 방법
US7848890B2 (en) 2004-12-08 2010-12-07 Electronics And Telecommunications Research Institute Method and system for predicting gene pathway using gene expression pattern data and protein interaction data
KR100849497B1 (ko) * 2006-09-29 2008-07-31 한국전자통신연구원 온톨로지 매핑을 이용한 단백질 이름 정규화 방법
KR100836865B1 (ko) * 2006-09-29 2008-06-11 고려대학교 산학협력단 마이크로어레이 실험 정보의 통합 관리 방법 및 그기록매체
KR100897523B1 (ko) * 2006-12-05 2009-05-15 한국전자통신연구원 유전자 상동성 정보를 이용한 유전자 리스트의 생체패스웨이 할당 장치 및 그 방법
KR101067352B1 (ko) * 2009-11-19 2011-09-23 한국생명공학연구원 생물학적 네트워크 분석을 이용한 마이크로어레이 실험 자료의 작용기작, 실험/처리 조건 특이적 네트워크 생성 및 실험/처리 조건 관계성 해석을 위한 알고리즘을 포함한 시스템 및 방법과 상기 방법을 수행하기 위한 프로그램을 갖는 기록매체
KR101151785B1 (ko) * 2010-01-18 2012-05-31 한국기초과학지원연구원 유전자 명명법을 이용한 상동체 유전자의 발굴 방법
CN116150864B (zh) * 2023-04-25 2023-07-04 中国建筑第五工程局有限公司 一种从bim模型中自动生成建筑结构分析模型的方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5887120A (en) * 1995-05-31 1999-03-23 Oracle Corporation Method and apparatus for determining theme for discourse
WO1999039174A2 (fr) * 1998-01-29 1999-08-05 Yissum Research Development Company Of The Hebrew University Of Jerusalem Procede automatique de classification de molecules
WO1999067727A1 (fr) * 1998-06-25 1999-12-29 Microsoft Corporation Procede et systeme de visualisation d'agregats et de classifications

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5887120A (en) * 1995-05-31 1999-03-23 Oracle Corporation Method and apparatus for determining theme for discourse
WO1999039174A2 (fr) * 1998-01-29 1999-08-05 Yissum Research Development Company Of The Hebrew University Of Jerusalem Procede automatique de classification de molecules
WO1999067727A1 (fr) * 1998-06-25 1999-12-29 Microsoft Corporation Procede et systeme de visualisation d'agregats et de classifications

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BAKER P.G. ET AL.: "An ontology for bioinformatics applications", BIOINFORMATICS, vol. 15, no. 6, 1999, pages 510 - 520, XP002230457, DOI: doi:10.1093/bioinformatics/15.6.510 *
BERTONE P., GERSTEIN M.: "Integrative data mining: the new direction in bioinformatics", ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, IEEE, vol. 20, no. 4, July 2001 (2001-07-01) - August 2001 (2001-08-01), pages 33 - 40 *
PATON N.W. ET AL.: "A query processing in the TAMBIS bioinformatics source integration system", PROC. 11TH INT. CONF. ON SCIENTIFIC AND STATISTICAL DATABASES (SSDBM), IEEE PRESS, 1999, pages 138 - 147, XP010348735, DOI: doi:10.1109/SSDM.1999.787629 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8396872B2 (en) 2010-05-14 2013-03-12 National Research Council Of Canada Order-preserving clustering data analysis system and method
CN102567314A (zh) * 2010-12-07 2012-07-11 中国电信股份有限公司 知识查询装置和方法
CN102567314B (zh) * 2010-12-07 2015-03-04 中国电信股份有限公司 知识查询装置和方法
CN103366098A (zh) * 2013-07-24 2013-10-23 国家电网公司 一种基于实验资源树的实验能力定量评价方法

Also Published As

Publication number Publication date
AU2003212669A1 (en) 2003-09-09
KR20030071225A (ko) 2003-09-03
KR100431620B1 (ko) 2004-05-17

Similar Documents

Publication Publication Date Title
Dubitzky et al. Introduction to microarray data analysis
US9147037B2 (en) Automated analysis of multiplexed probe-target interaction patterns: pattern matching and allele identification
US20040049354A1 (en) Method, system and computer software providing a genomic web portal for functional analysis of alternative splice variants
US20020183936A1 (en) Method, system, and computer software for providing a genomic web portal
JP5464503B2 (ja) 医療分析システム
US20060142949A1 (en) System, method, and computer program product for dynamic display, and analysis of biological sequence data
EP1222602A2 (fr) Systeme d'intelligence artificielle pour l'analyse genetique
US20150310165A1 (en) Efficient comparison of polynucleotide sequences
WO2003072701A1 (fr) Systeme destine a analyser des puces a adn au moyen d'une ontologie genetique et methode associee
US20070143031A1 (en) Method of analyzing a bio chip
US20040030504A1 (en) System, method, and computer program product for the representation of biological sequence data
WO2006001896A2 (fr) Puce a adn universelle pour analyse chimiogenomique a haut rendement
Chen et al. How will bioinformatics impact signal processing research?
Saei et al. A glance at DNA microarray technology and applications
Ho et al. DNA microarrays in prostate cancer
Zubi et al. Sequence mining in DNA chips data for diagnosing cancer patients
US6994965B2 (en) Method for displaying results of hybridization experiment
KR20050096044A (ko) 유전자 기능 분석 방법
US20040073527A1 (en) Method, system and computer software for predicting protein interactions
Agapito et al. A software pipeline for multiple microarray data analysis
Brush Making sense of microchip array data
Monforte et al. Strategy for gene expression-based biomarker discovery
Zubi et al. Using sequence DNA chips data to Mining and Diagnosing Cancer Patients
Yang et al. Multiagent framework for bio-data mining
Stubbs et al. Microarray bioinformatics

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP