CN105808972A - Method for predicting protein structure from local to global on basis of knowledge spectrum - Google Patents

Method for predicting protein structure from local to global on basis of knowledge spectrum Download PDF

Info

Publication number
CN105808972A
CN105808972A CN201610139514.4A CN201610139514A CN105808972A CN 105808972 A CN105808972 A CN 105808972A CN 201610139514 A CN201610139514 A CN 201610139514A CN 105808972 A CN105808972 A CN 105808972A
Authority
CN
China
Prior art keywords
distance
sequence
protein
spectrum
search sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610139514.4A
Other languages
Chinese (zh)
Inventor
张贵军
俞旭锋
周晓根
郝小虎
王柳静
李章维
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN201610139514.4A priority Critical patent/CN105808972A/en
Publication of CN105808972A publication Critical patent/CN105808972A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment

Landscapes

  • Spectroscopy & Molecular Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The invention discloses a method for predicting a protein structure from local to global on the basis of a knowledge spectrum. The method comprises the following steps: for a query sequence, obtaining a high-quality fragment library through a multi-feature seamless threading method, and obtaining distance spectrum knowledge between residues through statistic consistency analysis on the basis of the fragment library; dividing the query sequence into several segments of structures according to residue information recorded in a distance spectrum; carrying out fragment assembling on each segment of structure to obtain relatively low energy and to ensure that the space distance between adjacent residues approaches to a predicted distance in the distance spectrum; carrying out fragment assembling on the unsegmented structures and calculating the global energy to obtain a metastable-state conformation with low energy and more reasonable structure. The method disclosed in the invention is relatively good in conformation space sampling ability and high in prediction precision.

Description

A kind of based on composing knowledge from local to the Advances in protein structure prediction of the overall situation
Technical field
The present invention relates to bioinformatics, computer application field, in particular a kind of based on spectrum knowledge from Local is to the Advances in protein structure prediction of the overall situation.
Background technology
Protein molecule plays vital effect in biological cell chemical reaction process.Their structure mould Type and biological activity state are to we have appreciated that and cure multiple disease have important meaning.Protein is only folded into Specific three dimensional structure could produce its distinctive biological function.It is therefore to be understood that the function of protein, just Its three-D space structure must be obtained.
Tertiary protein structure prediction is a vital task of bioinformatics.Protein conformation optimization problem is existing It is that extremely complex protein energy function surface is scanned for facing maximum challenge.Protein energy mould Type considers molecular system bonding effect and Van der Waals force, electrostatic, hydrogen bond, the non-bonding effect such as hydrophobic, causes The Energy Surface formed it into is the most coarse, and conformation correspondence local minimizers number number is finger with the increase of sequence length Number increases.And the mechanism that protein conformation prediction algorithm can find protein stabilization structure is, substantial amounts of albumen Matter meta structure constitutes low energy area, thus can find protein the overall situation rock-steady structure it is crucial that Algorithm can find substantial amounts of protein meta structure, i.e. increases the population diversity of algorithm.Therefore, for More accurate protein force field model, chooses effective conformational space optimized algorithm, makes new protein structure Prediction algorithm has more universality and high efficiency becomes the focal issue of protein structure prediction in bioinformatics.
At present, Advances in protein structure prediction substantially can be divided into two classes, method based on template and be not based on mould The method of plate.Wherein, ab initio prediction (Ab-inito) method being not based on template is most widely used.It is suitable for In the homology most protein less than 25%, only produce brand new from sequence, to Protein Molecular Design And the research of protein folding etc. is significant.Currently there is following several comparison successful ab initio prediction method: Zhang Yangyu Jeffrey Skolnick cooperation TASSER (Threading/Assembly/Refinement) method, The FeLTr method etc. of the designs such as David Baker and the Rosetta method of team's design, Shehu.But up till now Also not having a kind of the most perfect method to carry out the three dimensional structure of predicted protein matter, well predicting knot even if obtaining Really, but also just for some protein for, technical bottleneck currently mainly is two aspects, first Aspect is the method for sampling, and prior art is not strong to conformational space ability in sampling, further aspect is that conformation updates Method, prior art is the most not enough to the renewal precision of conformation.
Therefore, existing Advances in protein structure prediction Shortcomings, need to improve.
Summary of the invention
In order to overcome that the conformational space ability in sampling of existing Advances in protein structure prediction is strong, precision of prediction relatively Low deficiency, the present invention proposes that a kind of conformational space ability in sampling is preferable, precision of prediction is high based on spectrum knowledge from Local is to the Advances in protein structure prediction of the overall situation.
The technical solution adopted for the present invention to solve the technical problems is:
A kind of based on spectrum knowledge from local to the overall situation Advances in protein structure prediction, described optimization method include with Lower step:
1) given search sequence information;
2) from Protein Data Bank (PDB) website to be, download resolution less thanHigh accuracy protein, wherein For parasang,Rice, removes sequence similarity according to sequence alignment algorithms NW-Align and is more than The amino acid chain of 30%, obtains non-redundant proteins template base;
3) according to multi-feature similarity function:
By the protein chain in seamless threading method comparison nonredundancy template base relative to search sequence each residue position (i, j), wherein i is search sequence resi-dues to score f put, and j is fragment structure;F (i, j) in, under Mark q represents search sequence feature score item, and subscript t represents template protein feature score item, Pq(i,k) The sequence frequency obtained by PSI-BLAST for search sequence is composed, and wherein k is predetermined number amino acids Type;Lq(i, k) and Lt(j k) is the search sequence obtained by PSI-BLAST and template sequence logarithmic spectrum; sstJ () is the classification of template protein secondary structure, DSSP be calculated;ssqI () is search sequence two grades Textural classification, can be obtained by two layers of neural metwork training;sat(j) and saqI () is formwork structure and search sequence Solvent accessibility index, is trained by EDTSurf and neural network procedure and obtains;ψqJ () is inquiry Sequence dihedral angle is to being obtained by two layers of neural metwork training;ψtJ () passes through query protein word Allusion quotation obtains;SPt(j k) is the structure spectrum of template protein;w1、w2、w3、w4And w5For weighted value;
4) according to similarity score f, (i j) chooses M fragment of highest scoring on each position of search sequence and obtains fragment Library file;
5) statistical query sequence is to the distance between coming from a template segments, and the most only statistics is less than's Residue to spacing, draw rectangular histogram and obtain distance spectrum, the distance of rectangular histogram abscissa is spaced apart When in template residue between distance in certain interval, then this interval sum just adds 1, if broken line graph existsThere is peak value in certain interior distance interval, then the distance interval that this peak value is corresponding is in target sequence Residue i, to the Prediction distance of residue j, records the distance spectrum (profile) that this distribution is between two residues;
6) according to the position of residue in gained distance spectrum, search sequence is divided into n section;
7) l=1 is made, l ∈ 1,2,3 ..., and n}, operation following to segmental structure execution:
7.1) l section fragment structure is carried out fragment assembling;
7.2) calculate the distance between the resi-dues wherein containing distance spectrum information, and seek deviation with Prediction distance, Cumulative deviation value is also averaged and is designated as Δ D;
7.3) if Δ D < R, then this conformation storage being designated as cell, wherein R is structural precision constraints;
7.4) 7.1 are repeated) to 7.3) until storage cell number reaches 100 cell, based on Rosetta Score 3 The energy of segmental structure in comparison cell, the structure choosing minimum energy is the prediction knot of this segmentation Structure;
7.5) l=l+1, it is judged that whether l, more than or equal to n, is to enter 8), otherwise return to 7.1);
8) arranging iterations is G, makes s=1, the following operation of execution:
8.1) target induced conformational ENERGY E (P is calculatedtarget);
8.2) unsegmented structure is carried out fragment assembling, calculate energy value E (Ptrail);
8.3) if E is (Ptarget)>E(Ptrail) then use PtrailReplace Ptarget
8.4) s=s+1;Judge that s, whether more than or equal to G, is to enter 9), otherwise return to 8.1);
9) output induced conformational, obtains search sequence nearly native state structure.
The technology of the present invention is contemplated that: obtain high-quality firstly, for search sequence by the seamless threading method of multiple features Amount fragment library, obtains the distance spectrum knowledge between residue-residue based on fragment library by statistics consistency analysis;Then, Search sequence is divided into and is divided into a few segment structure according to the residue information of record in distance spectrum;Afterwards, for each section Structure is assembled by fragment and obtains that energy is relatively low and space length between residue-residue approaches in distance spectrum and finds range in advance From;Carry out fragment assembling finally, for unsegmented structure, calculate global energy, obtain that energy is low and structure more For rational metastable state conformation.
The invention have the benefit that conformational space ability in sampling is preferable, precision of prediction is high.Accompanying drawing explanation
Fig. 1 is cycle tests relation schematic diagram of RMSD and energy value during population recruitment.
Fig. 2 is 1ENH algorithm predicts gained protein three-dimensional structure schematic diagram.
Detailed description of the invention
The invention will be further described below in conjunction with the accompanying drawings.
With reference to Fig. 1~Fig. 2, a kind of based on spectrum knowledge from local to the Advances in protein structure prediction of the overall situation, including Following steps:
1) given search sequence information;
2) from Protein Data Bank (PDB) website to be, download resolution less thanHigh accuracy protein, wherein For parasang,Rice, removes sequence similarity according to sequence alignment algorithms NW-Align and is more than The amino acid chain of 30%, obtains non-redundant proteins template base;
3) according to multi-feature similarity function:
By the protein chain in seamless threading method comparison nonredundancy template base relative to search sequence each residue position (i, j), wherein i is search sequence resi-dues to score f put, and j is fragment structure;F (i, j) in, under Mark q represents search sequence feature score item, and subscript t represents template protein feature score item, Pq(i,k) The sequence frequency obtained by PSI-BLAST for search sequence is composed, and wherein k is predetermined number amino acids Type;Lq(i, k) and Lt(j k) is the search sequence obtained by PSI-BLAST and template sequence logarithmic spectrum; sstJ () is the classification of template protein secondary structure, DSSP be calculated;ssqI () is search sequence two grades Textural classification, can be obtained by two layers of neural metwork training;sat(j) and saqI () is formwork structure and search sequence Solvent accessibility index, is trained by EDTSurf and neural network procedure and obtains;ψqJ () is inquiry Sequence dihedral angle is to being obtained by two layers of neural metwork training;ψtJ () passes through query protein word Allusion quotation obtains;SPt(j k) is the structure spectrum of template protein;w1、w2、w3、w4And w5For weighted value;
4) according to similarity score f, (i j) chooses M fragment of highest scoring on each position of search sequence and obtains fragment Library file;
5) statistical query sequence is to the distance between coming from a template segments, and the most only statistics is less than's Residue to spacing, draw rectangular histogram and obtain distance spectrum, the distance of rectangular histogram abscissa is spaced apart When in template residue between distance in certain interval, then this interval sum just adds 1, if broken line graph existsThere is peak value in certain interior distance interval, then the distance interval that this peak value is corresponding is in target sequence Residue i, to the Prediction distance of residue j, records the distance spectrum (profile) that this distribution is between two residues;
6) according to the position of residue in gained distance spectrum, search sequence is divided into n section;
7) l=1 is made, l ∈ 1,2,3 ..., and n}, operation following to segmental structure execution:
7.1) l section fragment structure is carried out fragment assembling;
7.2) calculate the distance between the resi-dues wherein containing distance spectrum information, and seek deviation with Prediction distance, Cumulative deviation value is also averaged and is designated as Δ D;
7.3) if Δ D < R, then this conformation storage being designated as cell, wherein R is structural precision constraints;
7.4) 7.1 are repeated) to 7.3), until storage cell number reaches x cell, compare based on Rosetta Score 3 To the energy of segmental structure in cell, the structure choosing minimum energy is the pre-geodesic structure of this segmentation;
7.5) l=l+1, it is judged that whether l, more than or equal to n, is to enter 8), otherwise return to 7.1);
8) arranging iterations is G, makes s=1, the following operation of execution:
8.1) calculate target induced conformational ENERGY E (Ptarget);
8.2) unsegmented structure is carried out fragment assembling, calculate energy value E (Ptrail);
8.3) if E is (Ptarget)>E(Ptrail) then use PtrailReplace Ptarget
8.4) s=s+1;Judge that s, whether more than or equal to G, is to enter 9), otherwise return to 8.1);
9) output induced conformational, obtains search sequence nearly native state structure.
The present embodiment with protein 1ENH that sequence length is 54 as embodiment, a kind of based on spectrum knowledge from local To the Advances in protein structure prediction of the overall situation, wherein comprise the steps of
1) given search sequence information;
2) from Protein Data Bank (PDB) website to be, download resolution less thanHigh accuracy protein, wherein For parasang,Rice, removes sequence similarity according to sequence alignment algorithms NW-Align and is more than The amino acid chain of 30%, obtains 8619 protein chains and constitutes nonredundancy template base;
3) according to multi-feature similarity function:
By the protein chain in seamless threading method comparison nonredundancy template base relative to search sequence each residue position (i, j), wherein i is search sequence resi-dues to score f put, and j is fragment structure;F (i, j) in, under Mark q represents search sequence feature score item, and subscript t represents template protein feature score item, Pq(i,k) The sequence frequency obtained by PSI-BLAST for search sequence is composed, and wherein k is predetermined number amino acids Type;Lq(i, k) and Lt(j k) is the search sequence obtained by PSI-BLAST and template sequence logarithmic spectrum; sstJ () is the classification of template protein secondary structure, DSSP be calculated;ssqI () is search sequence two grades Textural classification, can be obtained by two layers of neural metwork training;sat(j) and saqI () is formwork structure and search sequence Solvent accessibility index, is trained by EDTSurf and neural network procedure and obtains;ψqJ () is inquiry Sequence dihedral angle is to being obtained by two layers of neural metwork training;ψtJ () passes through query protein word Allusion quotation obtains;SPt(j k) is the structure spectrum of template protein;w1、w2、w3、w4And w5For weighted value;
4) according to similarity score f, (i, 200 fragments j) choosing highest scoring on each position of search sequence obtain sheet Phase library file;
5) statistical query sequence is to the distance between coming from a template segments, and the most only statistics is less than's Residue to spacing, draw rectangular histogram and obtain distance spectrum, the distance of rectangular histogram abscissa is spaced apart When in template residue between distance in certain interval, then this interval sum just adds 1, if broken line graph existsThere is peak value in certain interior distance interval, then the distance interval that this peak value is corresponding is in target sequence Residue i, to the Prediction distance of residue j, records the distance spectrum (profile) that this distribution is between two residues;
6) according to the position of residue in gained distance spectrum, search sequence is divided into n section;
7) l=1 is made, l ∈ 1,2,3 ..., and n}, operation following to segmental structure execution:
7.1) l section fragment structure is carried out fragment assembling;
7.2) calculate the distance between the resi-dues wherein containing distance spectrum information, and seek deviation with Prediction distance, Cumulative deviation value is also averaged and is designated as Δ D;
7.3) if Δ D < R, then this conformation storage being designated as cell, wherein R is structural precision constraints;
7.4) 7.1 are repeated) to 7.3), until storage cell number reaches x cell, compare based on Rosetta Score 3 To the energy of segmental structure in cell, the structure choosing minimum energy is the pre-geodesic structure of this segmentation;
7.5) l=l+1, it is judged that whether l, more than or equal to n, is to enter 8), otherwise return to 7.1);
8) arranging iterations is G=50000, makes s=1, the following operation of execution:
8.1) target induced conformational ENERGY E (P is calculatedtarget);
8.2) unsegmented structure is carried out fragment assembling, calculate energy value E (Ptrail);
8.3) if E is (Ptarget)>E(Ptrail) then use PtrailReplace Ptarget
8.4) s=s+1;Judge that s, whether more than or equal to G=50000, is to enter 9), otherwise return to 8.1);
9) output induced conformational, obtains search sequence nearly native state structure.
With protein 1ENH that sequence length is 54 as embodiment, above method is used to obtain this protein Nearly native state conformation, in conformation assemblage, conformation updates figure as it is shown in figure 1, algorithm predicts gained protein tridimensional is tied Structure is shown as shown in Figure 2.
The excellent results that the embodiment that the present invention is given that described above is shows, it is clear that the present invention is not Be suitable only for above-described embodiment, without departing from essence spirit of the present invention and without departing from involved by flesh and blood of the present invention in It can be done many variations on the premise of appearance to be carried out.

Claims (1)

1. one kind based on spectrum knowledge from local to the overall situation Advances in protein structure prediction, it is characterised in that: described albumen Matter Structure Prediction Methods comprises the following steps:
1) given search sequence information;
2) from Protein Data Bank website to be, download resolution less thanHigh accuracy protein, whereinFor distance Unit,Remove sequence similarity according to sequence alignment algorithms NW-Align and be more than 30% Amino acid chain, obtains non-redundant proteins template base;
3) according to multi-feature similarity function:
By the protein chain in seamless threading method comparison nonredundancy template base relative to search sequence each residue position (i, j), wherein i is search sequence resi-dues to score f put, and j is fragment structure;F (i, j) in, under Mark q represents search sequence feature score item, and subscript t represents template protein feature score item, Pq(i,k) The sequence frequency obtained by PSI-BLAST for search sequence is composed, and wherein k is predetermined number amino acids Type;Lq(i, k) and Lt(j k) is the search sequence obtained by PSI-BLAST and template sequence logarithmic spectrum; sstJ () is the classification of template protein secondary structure, DSSP be calculated;ssqI () is search sequence two grades Textural classification, can be obtained by two layers of neural metwork training;sat(j) and saqI () is formwork structure and search sequence Solvent accessibility index, is trained by EDTSurf and neural network procedure and obtains;ψqJ () is inquiry Sequence dihedral angle is to being obtained by two layers of neural metwork training;ψtJ () passes through query protein word Allusion quotation obtains;SPt(j k) is the structure spectrum of template protein;w1、w2、w3、w4And w5For weighted value;
4) according to similarity score f, (i j) chooses M fragment of highest scoring on each position of search sequence and obtains fragment Library file;
5) statistical query sequence is to the distance between coming from a template segments, and the most only statistics is less than's Residue to spacing, draw rectangular histogram and obtain distance spectrum, the distance of rectangular histogram abscissa is spaced apart When in template residue between distance in certain interval, then this interval sum just adds 1, if broken line graph existsThere is peak value in certain interior distance interval, then the distance interval that this peak value is corresponding is in target sequence Residue i, to the Prediction distance of residue j, records the distance spectrum (profile) that this distribution is between two residues;
6) according to the position of residue in gained distance spectrum, search sequence is divided into n section;
7) l=1 is made, l ∈ 1,2,3 ..., and n}, operation following to segmental structure execution:
7.1) l section fragment structure is carried out fragment assembling;
7.2) calculate the distance between the resi-dues wherein containing distance spectrum information, and seek deviation with Prediction distance, Cumulative deviation value is also averaged and is designated as Δ D;
7.3) if Δ D < R, then this conformation storage being designated as cell, wherein R is structural precision constraints;
7.4) 7.1 are repeated) to 7.3), until storage cell number reaches x cell, compare based on Rosetta Score 3 To the energy of segmental structure in cell, the structure choosing minimum energy is the pre-geodesic structure of this segmentation;
7.5) l=l+1, it is judged that whether l, more than or equal to n, is to enter 8), otherwise return to 7.1);
8) arranging iterations is G, makes s=1, the following operation of execution:
8.1) target induced conformational ENERGY E (P is calculatedtarget);
8.2) unsegmented structure is carried out fragment assembling, calculate energy value E (Ptrail);
8.3) if E is (Ptarget)>E(Ptrail) then use PtrailReplace Ptarget
8.4) s=s+1;Judge that s, whether more than or equal to G, is to enter 9), otherwise return to 8.1);
9) output induced conformational, obtains search sequence nearly native state structure.
CN201610139514.4A 2016-03-11 2016-03-11 Method for predicting protein structure from local to global on basis of knowledge spectrum Pending CN105808972A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610139514.4A CN105808972A (en) 2016-03-11 2016-03-11 Method for predicting protein structure from local to global on basis of knowledge spectrum

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610139514.4A CN105808972A (en) 2016-03-11 2016-03-11 Method for predicting protein structure from local to global on basis of knowledge spectrum

Publications (1)

Publication Number Publication Date
CN105808972A true CN105808972A (en) 2016-07-27

Family

ID=56467142

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610139514.4A Pending CN105808972A (en) 2016-03-11 2016-03-11 Method for predicting protein structure from local to global on basis of knowledge spectrum

Country Status (1)

Country Link
CN (1) CN105808972A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106372456A (en) * 2016-08-26 2017-02-01 浙江工业大学 Deep learning Residue2vec-based protein structure prediction method
CN107180164A (en) * 2017-04-19 2017-09-19 浙江工业大学 A kind of multiple domain protein structure assemble method based on template
CN107273713A (en) * 2017-05-26 2017-10-20 浙江工业大学 A kind of multiple domain protein template searching method based on TM align
CN107609340A (en) * 2017-07-24 2018-01-19 浙江工业大学 A kind of multiple domain albumen distance spectrum construction method
CN109360597A (en) * 2018-08-01 2019-02-19 浙江工业大学 A kind of group's Advances in protein structure prediction based on global and local policy cooperation
CN109378034A (en) * 2018-08-28 2019-02-22 浙江工业大学 A kind of protein prediction method based on range distribution estimation
CN112585685A (en) * 2018-09-21 2021-03-30 渊慧科技有限公司 Machine learning to determine protein structure

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105205348A (en) * 2015-09-22 2015-12-30 浙江工业大学 Method for colony conformation space optimization based on distance constraint selection strategy

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105205348A (en) * 2015-09-22 2015-12-30 浙江工业大学 Method for colony conformation space optimization based on distance constraint selection strategy

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DONG XU AND YANG ZHANG: "Ab initio protein structure assembly using continuous structure fragments and optimized konwledge-based force field", 《PROTEINS: STRUCTURE, FUNCTION, BIOINFORMATICS》 *
DONG XU AND YANG ZHANG: "Toward optimal fragment generations for ab initio protein structure assembly", 《PROTEINS: STRUCTURE, FUNCTION, BIOINFORMATICS》 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106372456B (en) * 2016-08-26 2019-01-22 浙江工业大学 A kind of Advances in protein structure prediction based on deep learning
CN106372456A (en) * 2016-08-26 2017-02-01 浙江工业大学 Deep learning Residue2vec-based protein structure prediction method
CN107180164A (en) * 2017-04-19 2017-09-19 浙江工业大学 A kind of multiple domain protein structure assemble method based on template
CN107273713A (en) * 2017-05-26 2017-10-20 浙江工业大学 A kind of multiple domain protein template searching method based on TM align
CN107273713B (en) * 2017-05-26 2020-06-02 浙江工业大学 Multi-domain protein template searching method based on TM-align
CN107609340B (en) * 2017-07-24 2020-05-05 浙江工业大学 Multi-domain protein distance spectrum construction method
CN107609340A (en) * 2017-07-24 2018-01-19 浙江工业大学 A kind of multiple domain albumen distance spectrum construction method
CN109360597B (en) * 2018-08-01 2020-12-01 浙江工业大学 Group protein structure prediction method based on global and local strategy cooperation
CN109360597A (en) * 2018-08-01 2019-02-19 浙江工业大学 A kind of group's Advances in protein structure prediction based on global and local policy cooperation
CN109378034A (en) * 2018-08-28 2019-02-22 浙江工业大学 A kind of protein prediction method based on range distribution estimation
CN109378034B (en) * 2018-08-28 2021-06-18 浙江工业大学 Protein prediction method based on distance distribution estimation
CN112585685A (en) * 2018-09-21 2021-03-30 渊慧科技有限公司 Machine learning to determine protein structure
CN112585686A (en) * 2018-09-21 2021-03-30 渊慧科技有限公司 Machine learning to determine protein structure

Similar Documents

Publication Publication Date Title
CN105808972A (en) Method for predicting protein structure from local to global on basis of knowledge spectrum
Li et al. POSREG: proteomic signature discovered by simultaneously optimizing its reproducibility and generalizability
Blaabjerg et al. Rapid protein stability prediction using deep learning representations
Ceriotti et al. Demonstrating the transferability and the descriptive power of sketch-map
Eckert et al. Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches
Pierce et al. A combination of rescoring and refinement significantly improves protein docking performance
Viswanath et al. Improving ranking of models for protein complexes with side chain modeling and atomic potentials
Hou et al. SeRenDIP-CE: sequence-based interface prediction for conformational epitopes
Zheng et al. Improving deep learning protein monomer and complex structure prediction using DeepMSA2 with huge metagenomics data
Anishchenko et al. Structural templates for comparative protein docking
CA2942106A1 (en) Aligning and clustering sequence patterns to reveal classificatory functionality of sequences
Kmiecik et al. Towards the high-resolution protein structure prediction. Fast refinement of reduced models with all-atom force field
Zhang et al. Predicting linear B-cell epitopes by using sequence-derived structural and physicochemical features
Oldfield et al. Computational prediction of secondary and supersecondary structures from protein sequences
Dal Palu et al. CLP-based protein fragment assembly
CN105468934A (en) Method for constructing distance model between protein residues based on Bolzmann probability density function
Liu et al. Assessing protein model quality based on deep graph coupled networks using protein language model
Arshad et al. An explainable stacking-based approach for accelerating the prediction of antidiabetic peptides
Shi et al. A review of machine learning-based methods for predicting drug–target interactions
Hassan et al. Classify a protein domain using sigmoid support vector machine
US8024127B2 (en) Local-global alignment for finding 3D similarities in protein structures
Wang et al. A sequence-based computational method for prediction of MoRFs
Geethu et al. Improved 3-D protein structure predictions using deep ResNet model
Kern et al. Predicting interacting residues using long-distance information and novel decoding in hidden markov models
Hashmi et al. Informatics-driven protein-protein docking

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160727

RJ01 Rejection of invention patent application after publication