CN105825078B - 基于基因大数据的小样本基因表达数据分类方法 - Google Patents
基于基因大数据的小样本基因表达数据分类方法 Download PDFInfo
- Publication number
- CN105825078B CN105825078B CN201610150049.4A CN201610150049A CN105825078B CN 105825078 B CN105825078 B CN 105825078B CN 201610150049 A CN201610150049 A CN 201610150049A CN 105825078 B CN105825078 B CN 105825078B
- Authority
- CN
- China
- Prior art keywords
- gene
- rank
- gene expression
- sample
- frequency
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000014509 gene expression Effects 0.000 title claims abstract description 66
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 49
- 238000000034 method Methods 0.000 title claims abstract description 22
- 239000011159 matrix material Substances 0.000 claims abstract description 21
- 238000012549 training Methods 0.000 claims abstract description 17
- 238000012360 testing method Methods 0.000 claims abstract description 15
- 230000004069 differentiation Effects 0.000 claims abstract description 8
- 238000012217 deletion Methods 0.000 claims abstract description 4
- 230000037430 deletion Effects 0.000 claims abstract description 4
- 238000009472 formulation Methods 0.000 claims description 8
- 239000000203 mixture Substances 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 4
- 238000013138 pruning Methods 0.000 claims description 4
- 101150084044 P gene Proteins 0.000 claims description 3
- 238000011160 research Methods 0.000 description 5
- 206010028980 Neoplasm Diseases 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 244000287680 Garcinia dulcis Species 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 101150010487 are gene Proteins 0.000 description 1
- 238000002790 cross-validation Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 101150044508 key gene Proteins 0.000 description 1
- 238000012775 microarray technology Methods 0.000 description 1
- 230000003014 reinforcing effect Effects 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 238000005728 strengthening Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
Landscapes
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610150049.4A CN105825078B (zh) | 2016-03-16 | 2016-03-16 | 基于基因大数据的小样本基因表达数据分类方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610150049.4A CN105825078B (zh) | 2016-03-16 | 2016-03-16 | 基于基因大数据的小样本基因表达数据分类方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105825078A CN105825078A (zh) | 2016-08-03 |
CN105825078B true CN105825078B (zh) | 2019-02-26 |
Family
ID=56523451
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610150049.4A Active CN105825078B (zh) | 2016-03-16 | 2016-03-16 | 基于基因大数据的小样本基因表达数据分类方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105825078B (zh) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108241792B (zh) * | 2016-12-23 | 2021-03-23 | 深圳华大基因科技服务有限公司 | 一种整合多平台基因分型结果的方法和装置 |
CN107016260B (zh) * | 2017-03-30 | 2019-09-13 | 广东工业大学 | 一种基于跨平台基因表达数据的基因调控网络重建方法 |
CN108182347B (zh) * | 2018-01-17 | 2022-02-22 | 广东工业大学 | 一种大规模跨平台基因表达数据分类方法 |
CN108985010B (zh) * | 2018-06-15 | 2022-04-08 | 河南师范大学 | 基因分类方法与装置 |
CN109754843B (zh) * | 2018-12-04 | 2021-02-19 | 志诺维思(北京)基因科技有限公司 | 一种探测基因组小片段插入缺失的方法及装置 |
CN110222745B (zh) * | 2019-05-24 | 2021-04-30 | 中南大学 | 一种基于相似性学习及其增强的细胞类型鉴定方法 |
CN110706746B (zh) * | 2019-11-27 | 2021-09-17 | 北京博安智联科技有限公司 | 一种dna混合分型数据库比对算法 |
CN111370124A (zh) * | 2020-03-05 | 2020-07-03 | 湖南城市学院 | 一种基于面手部识别和大数据的健康分析系统及方法 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101624628A (zh) * | 2008-07-09 | 2010-01-13 | 索尼株式会社 | 基因检测方法、基因检测程序和基因检测装置 |
CN101923604A (zh) * | 2010-07-23 | 2010-12-22 | 福建师范大学 | 基于邻域粗糙集的加权knn肿瘤基因表达谱分类方法 |
CN101921847A (zh) * | 2010-07-23 | 2010-12-22 | 福建师范大学 | 基于模糊k-nn算法的肿瘤基因表达谱分类方法 |
CN101996284A (zh) * | 2010-11-29 | 2011-03-30 | 昆明理工大学 | 某种疾病的特征基因的筛选方法 |
CN104156503A (zh) * | 2014-07-21 | 2014-11-19 | 金华市中心医院 | 一种基于基因芯片网络分析的疾病风险基因识别方法 |
CN104408332A (zh) * | 2014-11-05 | 2015-03-11 | 深圳先进技术研究院 | 一种基因数据处理方法及装置 |
-
2016
- 2016-03-16 CN CN201610150049.4A patent/CN105825078B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101624628A (zh) * | 2008-07-09 | 2010-01-13 | 索尼株式会社 | 基因检测方法、基因检测程序和基因检测装置 |
CN101923604A (zh) * | 2010-07-23 | 2010-12-22 | 福建师范大学 | 基于邻域粗糙集的加权knn肿瘤基因表达谱分类方法 |
CN101921847A (zh) * | 2010-07-23 | 2010-12-22 | 福建师范大学 | 基于模糊k-nn算法的肿瘤基因表达谱分类方法 |
CN101996284A (zh) * | 2010-11-29 | 2011-03-30 | 昆明理工大学 | 某种疾病的特征基因的筛选方法 |
CN104156503A (zh) * | 2014-07-21 | 2014-11-19 | 金华市中心医院 | 一种基于基因芯片网络分析的疾病风险基因识别方法 |
CN104408332A (zh) * | 2014-11-05 | 2015-03-11 | 深圳先进技术研究院 | 一种基因数据处理方法及装置 |
Non-Patent Citations (5)
Title |
---|
"A Novel Approach for Classifying Human Cancers";Shuqin Wang etal;《The 9th International Conference for Young Computer Scientists》;20081121;第976-981页 |
"An entropy-based improved k-top scoring pairs (TSP) method for classifying human cancers";Chunbao Zhou etal;《African Journal of Biotechnology》;20120630;第11卷(第45期);第10438-10445页 |
"Simple decision rules for classifying human cancers from gene expression profiles ";Aik Choon Tan etal;《Bioinformatics》;20051015;第21卷(第20期);第3896-3904页 |
"基于决策森林特征基因的两种识别方法";吕飒丽 等;《生物信息学》;20140920;第2卷(第3期);第19-22页 |
"基于基因表达谱的肿瘤亚型识别与分类";李颖新 等;《电子学报》;20050425;第33卷(第4期);第651-655页 |
Also Published As
Publication number | Publication date |
---|---|
CN105825078A (zh) | 2016-08-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105825078B (zh) | 基于基因大数据的小样本基因表达数据分类方法 | |
Aydadenta et al. | A clustering approach for feature selection in microarray data classification using random forest | |
Es-saady et al. | Automatic recognition of plant leaves diseases based on serial combination of two SVM classifiers | |
CN103632168B (zh) | 一种机器学习中的分类器集成方法 | |
CN109145097A (zh) | 一种基于信息提取的裁判文书分类方法 | |
CN109492673A (zh) | 一种基于谱聚类采样的不平衡数据预测方法 | |
Beikmohammadi et al. | SWP-LeafNET: A novel multistage approach for plant leaf identification based on deep CNN | |
CN103761295B (zh) | 基于图片自动分类的艺术类图片的定制化特征量提取方法 | |
Spiliopoulou et al. | Higher order mining: Modelling and mining the results of knowledge discovery | |
CN102521605A (zh) | 一种高光谱遥感图像波段选择方法 | |
Sidiropoulos et al. | Gazing at the skyline for star scientists | |
Lan et al. | Position-Aware ListMLE: A Sequential Learning Process for Ranking. | |
CN102663447B (zh) | 基于判别相关分析的跨媒体检索方法 | |
CN103077399B (zh) | 基于集成级联架构的生物显微图像分类方法 | |
CN102254033A (zh) | 基于熵权重的全局k-均值聚类方法 | |
CN110738053A (zh) | 基于语义分析与监督学习模型的新闻主题推荐算法 | |
CN104966075B (zh) | 一种基于二维判别特征的人脸识别方法与系统 | |
CN104850868A (zh) | 一种基于k-means和神经网络聚类的客户细分方法 | |
Pouyan et al. | Clustering single-cell expression data using random forest graphs | |
CN106548041A (zh) | 一种基于先验信息和并行二进制微粒群算法的肿瘤关键基因识别方法 | |
CN109800790A (zh) | 一种面向高维数据的特征选择方法 | |
CN106570537A (zh) | 一种基于混淆矩阵的随机森林模型选择方法 | |
CN106557785A (zh) | 一种优化数据分类的支持向量机方法 | |
CN110189799B (zh) | 基于变量重要性评分和奈曼皮尔逊检验的宏基因组特征选择方法 | |
CN107392249A (zh) | 一种k近邻相似度优化的密度峰聚类方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200915 Address after: 528000 Jiangwan Road, Guangdong, No. 18, No. Co-patentee after: GUANGDONG University OF TECHNOLOGY Patentee after: FOSHAN University Address before: 510006 Panyu District, Guangzhou, Guangzhou University,, West Ring Road, No. 100 Patentee before: GUANGDONG University OF TECHNOLOGY |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: 528000 No. 18, No. 1, Jiangwan, Guangdong, Foshan Patentee after: Foshan University Country or region after: China Patentee after: GUANGDONG University OF TECHNOLOGY Address before: 528000 No. 18, No. 1, Jiangwan, Guangdong, Foshan Patentee before: FOSHAN University Country or region before: China Patentee before: GUANGDONG University OF TECHNOLOGY |