CN111785321B - DNA binding residue prediction method based on deep convolutional neural network - Google Patents
DNA binding residue prediction method based on deep convolutional neural network Download PDFInfo
- Publication number
- CN111785321B CN111785321B CN202010533489.4A CN202010533489A CN111785321B CN 111785321 B CN111785321 B CN 111785321B CN 202010533489 A CN202010533489 A CN 202010533489A CN 111785321 B CN111785321 B CN 111785321B
- Authority
- CN
- China
- Prior art keywords
- residue
- protein sequence
- layer
- convolutional neural
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000004568 DNA-binding Effects 0.000 title claims abstract description 26
- 238000013527 convolutional neural network Methods 0.000 title claims abstract description 25
- 238000000034 method Methods 0.000 title claims abstract description 23
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 63
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 63
- 239000011159 matrix material Substances 0.000 claims abstract description 26
- 230000027455 binding Effects 0.000 claims abstract description 15
- 238000012549 training Methods 0.000 claims description 33
- 230000004913 activation Effects 0.000 claims description 4
- 238000010606 normalization Methods 0.000 claims description 4
- 238000011176 pooling Methods 0.000 claims description 4
- 238000004364 calculation method Methods 0.000 abstract description 4
- 239000003446 ligand Substances 0.000 abstract description 4
- 230000008569 process Effects 0.000 abstract description 2
- 230000006870 function Effects 0.000 description 10
- 108020004414 DNA Proteins 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 239000002253 acid Substances 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 102000038037 druggable proteins Human genes 0.000 description 1
- 108091007999 druggable proteins Proteins 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Software Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Medical Informatics (AREA)
- Evolutionary Biology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Genetics & Genomics (AREA)
- Analytical Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Bioethics (AREA)
- Databases & Information Systems (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
一种基于深度卷积神经网络的DNA绑定残基预测方法,首先,根据输入的残基数为L待进行配体绑定残基预测的蛋白质序列信息,使用psi‑blast程序和PSSpred程序获取矩阵PSSM和PSS;然后,将两个矩阵组合为一个特征矩阵F;其次,我们将蛋白质序列处理成残基样本;再次,搭建深度卷积神经网络,利用已知绑定残基的蛋白质序列构建数据集,并将数据集划分为M组数据子集,利用这十组数据子集训练出M个网络模型;最后,将待进行预测的蛋白质序列处理成残基样本,并输入到被训练过的M个网络模型中,综合这M个模型的预测结果,预测蛋白质序列中的残基是否为绑定残基。本发明计算代价小、预测精度高。
A method for predicting DNA binding residues based on a deep convolutional neural network. First, according to the input residue number of L, the protein sequence information of the ligand binding residues to be predicted is obtained by using the psi-blast program and the PSSpred program. matrices PSSM and PSS; then, combine the two matrices into a feature matrix F; secondly, we process protein sequences into residue samples; thirdly, build a deep convolutional neural network using protein sequences with known binding residues to construct The data set is divided into M groups of data subsets, and M network models are trained by using these ten groups of data subsets; finally, the protein sequences to be predicted are processed into residue samples and input to the trained In the M network models of , the prediction results of these M models are combined to predict whether the residues in the protein sequence are binding residues. The invention has low calculation cost and high prediction accuracy.
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010533489.4A CN111785321B (en) | 2020-06-12 | 2020-06-12 | DNA binding residue prediction method based on deep convolutional neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010533489.4A CN111785321B (en) | 2020-06-12 | 2020-06-12 | DNA binding residue prediction method based on deep convolutional neural network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111785321A CN111785321A (en) | 2020-10-16 |
CN111785321B true CN111785321B (en) | 2022-04-05 |
Family
ID=72756179
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010533489.4A Active CN111785321B (en) | 2020-06-12 | 2020-06-12 | DNA binding residue prediction method based on deep convolutional neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111785321B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112466392B (en) * | 2020-11-12 | 2024-03-22 | 浙江工业大学 | ATP binding residue prediction method based on deep convolutional network |
CN112365921B (en) * | 2020-11-17 | 2022-07-15 | 浙江工业大学 | A protein secondary structure prediction method based on long and short-term memory network |
CN113257342B (en) * | 2021-04-09 | 2024-05-07 | 浙江工业大学 | Protein interaction site prediction method based on residue position characteristics |
CN113096733B (en) * | 2021-05-11 | 2022-09-30 | 同济大学 | Die body mining method based on sequence and shape information deep fusion |
CN113851192B (en) * | 2021-09-15 | 2023-06-30 | 安庆师范大学 | Training method and device for amino acid one-dimensional attribute prediction model and attribute prediction method |
CN114512188B (en) * | 2022-03-20 | 2024-04-05 | 湖南大学 | DNA binding protein recognition method based on improved protein sequence position specificity matrix |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104992079A (en) * | 2015-06-29 | 2015-10-21 | 南京理工大学 | Sampling learning based protein-ligand binding site prediction method |
WO2018175986A1 (en) * | 2017-03-23 | 2018-09-27 | Rutgers, The State University Of New Jersey | Systems and methods for modeling a protein parameter for understanding protein interactions and generating an energy map |
CN111063389A (en) * | 2019-12-04 | 2020-04-24 | 浙江工业大学 | A Ligand Binding Residue Prediction Method Based on Deep Convolutional Neural Networks |
CN111081311A (en) * | 2019-12-26 | 2020-04-28 | 青岛科技大学 | Prediction of protein lysine malonylation sites based on deep learning |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2820048B1 (en) * | 2012-02-29 | 2019-04-17 | Gilead Biologics, Inc. | Antibodies to matrix metalloproteinase 9 |
CN107478754A (en) * | 2016-06-07 | 2017-12-15 | 复旦大学 | A kind of pre-treating method for detecting Residues in Milk aminoglycoside antibiotics |
CN110689920B (en) * | 2019-09-18 | 2022-02-11 | 上海交通大学 | A protein-ligand binding site prediction method based on deep learning |
-
2020
- 2020-06-12 CN CN202010533489.4A patent/CN111785321B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104992079A (en) * | 2015-06-29 | 2015-10-21 | 南京理工大学 | Sampling learning based protein-ligand binding site prediction method |
WO2018175986A1 (en) * | 2017-03-23 | 2018-09-27 | Rutgers, The State University Of New Jersey | Systems and methods for modeling a protein parameter for understanding protein interactions and generating an energy map |
CN111063389A (en) * | 2019-12-04 | 2020-04-24 | 浙江工业大学 | A Ligand Binding Residue Prediction Method Based on Deep Convolutional Neural Networks |
CN111081311A (en) * | 2019-12-26 | 2020-04-28 | 青岛科技大学 | Prediction of protein lysine malonylation sites based on deep learning |
Also Published As
Publication number | Publication date |
---|---|
CN111785321A (en) | 2020-10-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111785321B (en) | DNA binding residue prediction method based on deep convolutional neural network | |
CN116417093A (en) | A Drug-Target Interaction Prediction Method Combining Transformer and Graph Neural Network | |
Zheng et al. | Emerging deep learning methods for single-cell RNA-seq data analysis | |
CN111063389A (en) | A Ligand Binding Residue Prediction Method Based on Deep Convolutional Neural Networks | |
CN114783526B (en) | Deep unsupervised single-cell clustering method based on Gaussian mixture graph variational autoencoder | |
CN116386729A (en) | scRNA-seq data dimension reduction method based on graph neural network | |
CN114091603A (en) | A spatial transcriptome cell clustering and analysis method | |
CN112951328B (en) | miRNA-gene relationship prediction method and system based on deep learning heterogeneous information network | |
Zaki et al. | Identifying protein complexes in protein-protein interaction data using graph convolutional network | |
Suo et al. | Application of clustering analysis in brain gene data based on deep learning | |
Chen et al. | Learning and interpreting the gene regulatory grammar in a deep learning framework | |
Li et al. | Attention-based deep clustering method for scRNA-seq cell type identification | |
Jiang et al. | Protein-protein interaction sites prediction using batch normalization based CNNs and oversampling method borderline-SMOTE | |
CN112149885B (en) | Ligand binding residue prediction method based on sequence template | |
Termritthikun et al. | Evolutionary neural architecture search based on efficient CNN models population for image classification | |
CN114512188B (en) | DNA binding protein recognition method based on improved protein sequence position specificity matrix | |
Cai et al. | Realize generative yet complete latent representation for incomplete multi-view learning | |
Ko et al. | A deep learning adversarial autoencoder with dynamic batching displays high performance in denoising and ordering scRNA-seq data | |
CN112071362B (en) | Method for detecting protein complex fusing global and local topological structures | |
CN117409968B (en) | Hierarchical attention-based cancer dynamic survival analysis method and system | |
Bilen et al. | A new hybrid and ensemble gene selection approach with an enhanced genetic algorithm for classification of microarray gene expression values on leukemia cancer | |
Yang et al. | RNDEtree: regulatory network with differential equation based on flexible neural tree with novel criterion function | |
CN115862767A (en) | anti-LRRK 2 small molecule drug prediction and screening method based on graph learning | |
CN116343908A (en) | Method, medium and device for predicting protein coding region by fusing DNA shape characteristics | |
John et al. | CNN-BLSTM based deep learning framework for eukaryotic kinome classification: An explainability based approach |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20221109 Address after: D1101, Building 4, Software Industry Base, No. 19, 17, 18, Haitian 1st Road, Binhai Community, Yuehai Street, Nanshan District, Shenzhen, Guangdong, 518000 Patentee after: Shenzhen Xinrui Gene Technology Co.,Ltd. Address before: N2248, Floor 3, Xingguang Yingjing, No. 117, Shuiyin Road, Yuexiu District, Guangzhou, Guangdong 510,000 Patentee before: GUANGZHOU ZHAOJI BIOTECHNOLOGY CO.,LTD. Effective date of registration: 20221109 Address after: N2248, Floor 3, Xingguang Yingjing, No. 117, Shuiyin Road, Yuexiu District, Guangzhou, Guangdong 510,000 Patentee after: GUANGZHOU ZHAOJI BIOTECHNOLOGY CO.,LTD. Address before: 310014 No. 18 Chao Wang Road, Xiacheng District, Zhejiang, Hangzhou Patentee before: JIANG University OF TECHNOLOGY |
|
TR01 | Transfer of patent right |