CN105653892A - Distance spectrum intelligence based normal distribution distance receiving probability model construction method - Google Patents
Distance spectrum intelligence based normal distribution distance receiving probability model construction method Download PDFInfo
- Publication number
- CN105653892A CN105653892A CN201511008767.XA CN201511008767A CN105653892A CN 105653892 A CN105653892 A CN 105653892A CN 201511008767 A CN201511008767 A CN 201511008767A CN 105653892 A CN105653892 A CN 105653892A
- Authority
- CN
- China
- Prior art keywords
- distance
- sequence
- spectrum
- fragment
- residue
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
Landscapes
- Spectroscopy & Molecular Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Crystallography & Structural Chemistry (AREA)
- Biophysics (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Theoretical Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides a distance spectrum intelligence based normal distribution distance receiving probability model construction method. The method comprises: firstly, downloading a high resolution protein file with a known structure in a protein database, and removing a sequence whose homology is greater than a preset threshold by comparing a sequence similarity degree to form a non-redundancy template library; next, performing similarity degree comparison on a protein structure and a query sequence in the template library by means of a slide window, and selecting M segments with the highest score at each location of the query sequence to form a segment library file; then, selecting distances from the same segment structure in a segment library at two locations of the query sequence; and finally, according to distance distribution of an inter-residue distance spectrum, extracting a predicted distance and a variance to construct a probability density function of normal distribution, comparing structure similarity of an induced conformation, and receiving a conformation according to a distance receiving probability of normal probability. The method provided by the present invention has a stronger spatial sampling ability and higher update precision.
Description
Technical field
The present invention relates to bioinformatics, computer application field, in particular a kind of normal distribution distance probability of acceptance model building method based on distance spectrum knowledge.
Background technology
Protein molecule plays vital effect in biological cell chemical reaction process. Their structural model and biological activity state are to we have appreciated that and cure multiple disease have important meaning. Protein is only folded into specific three dimensional structure could produce its distinctive biological function. It is therefore to be understood that the function of protein, it is necessary for obtaining its three-D space structure.
Tertiary protein structure prediction is a vital task of bioinformatics. It is that extremely complex protein energy function surface is scanned for that protein conformation optimization problem faces maximum challenge now. Protein energy model considers molecular system and becomes key effect and Van der Waals force, electrostatic, hydrogen bond, the non-one-tenth key effect such as hydrophobic, causes the Energy Surface formed it into extremely coarse, and conformation correspondence local minimizers number number is exponentially increased with the increase of sequence length. And the mechanism that protein conformation prediction algorithm can find protein stabilization structure is, substantial amounts of protein meta structure constitutes low energy area, so can find the protein most rock-steady structure of the overall situation it is crucial that algorithm can find substantial amounts of protein meta structure, namely increase the population diversity of algorithm. Therefore, for more accurate protein force field model, choose effective conformational space optimized algorithm, make new protein structure prediction algorithm have more universality and high efficiency becomes the focal issue of protein structure prediction in bioinformatics.
At present, Advances in protein structure prediction substantially can be divided into two classes, based on method and the method being not based on template of template. Wherein, ab initio prediction (Ab-inito) method being not based on template is most widely used. It is applicable to the homology most protein less than 25%, only produces brand new from sequence, and the research of Protein Molecular Design and protein folding etc. is significant.Currently there is the successful ab initio prediction method of following several comparison: the FeLTr method etc. of the designs such as TASSER (Threading/Assembly/Refinement) method of Zhang Yangyu JeffreySkolnick cooperation, DavidBaker and the Rosetta method of team's design, Shehu. But the three dimensional structure of predicted protein matter is carried out up till now but without a kind of very perfect method, well predict the outcome even if obtaining, but also just for some protein, technical bottleneck currently mainly is in that two aspects, first aspect is in that the method for sampling, prior art is not strong to conformational space ability in sampling, further aspect is that conformation update method, and prior art is still not enough to the renewal precision of conformation.
Therefore, existing conformational space searching method Shortcomings, it is necessary to improve.
Summary of the invention
In order to the spatial sampling overcoming existing conformational space searching method is indifferent, update the deficiency that precision is relatively low, the present invention provides the normal distribution distance probability of acceptance model building method based on distance spectrum knowledge that a kind of spatial sampling ability is relatively strong, renewal precision is higher.
The technical solution adopted for the present invention to solve the technical problems is:
A kind of normal distribution distance probability of acceptance model building method based on distance spectrum knowledge, described model building method comprises the following steps:
1) nonredundancy template base is built:
1.1) from Protein Data Bank (PDB) website download resolution less thanHigh accuracy protein, whereinFor parasang,Rice;
1.2) protein containing a plurality of polypeptide chain is split into strand, and retain the longest chain and other chain comparative sequences similarities, remove the similarity redundancy polypeptide chain more than predetermined threshold value;
1.3) remaining polypeptide chain is sought sequence similarity I between twomn, add up the accumulative similarity of each chainWherein m, n are the sequence number of polypeptide chain, and N is the sum remaining all chains;
1.4) N bar chain is successively decreased arrangement according to accumulative similarity, start to compare removal sequence similarity successively with other chains more than the chain of predetermined threshold value from the chain that accumulative similarity is maximum, obtain non-redundant proteins template base;
2) input inquiry sequence;
3) fragment library is generated:
3.1) build structural similarity function f (i, j), wherein i is search sequence resi-dues, and j is fragment structure;
3.1.1) search sequence by PSI-BLAST comparison predetermined number aminoacid obtain sequence frequency compose to obtain subitem Pq(i, k), wherein i is search sequence resi-dues, and k is predetermined number amino acid classes, and q is search sequence indications;
3.1.2)Lq(i, k) and Lt(j, is k) by the PSI-BLAST search sequence obtained and template sequence logarithmic spectrum;
3.1.3) the secondary structure prediction ss obtaining formwork structure is calculated by PSSpredt;
3.1.4) it is trained obtaining search sequence secondary structure prediction index s s to sequence spectrum by neural network procedureq;
3.1.5) template protein solvent accessibility parameter sa is obtained by EDTSurf calculatingt;
3.1.6) search sequence solvent accessibility index s a is obtained by neural network procedure predictionq;
3.1.7) can predict, by two layers of neural network procedure training sequence spectrum and secondary structure, the dihedral angle obtaining search sequence��q;
3.1.8) barycenter atom dihedral angle can be obtained by query protein dictionary��tFor formwork structure;
3.1.9)SPt(j, k) for the frequency matrix of each residue absolute presupposition quantity residue type in formwork structure;
3.1.10) structural similarity function
Wherein w1, w2, w3, w4, w5For weighted value;
3.2) by gapless threading method with 3 residues for monomeric unit, being mated with search sequence by the fragment structure in nonredundancy template base, according to structural similarity function f, (i, j) gives a mark to fragment structure;
3.3) using a sliding window when search sequence is with template segments structure matching, (i, j), selects front M the fragment of highest scoring on each position and constitutes fragment library the similarity score f of search sequence i position of comparison and jth fragment;
4) distance spectrum is obtained:
4.1) fragment that on traversal queries sequence position, M similarity is higher,It is the fragment on search sequence i-th position, Fl j(l=1 ..., M) it is the fragment on search sequence jth position;
4.2) a is usedikAnd ajlRepresent the fragment structure coming from same formwork structure selected on i and j;
4.3) a is calculatedikAnd ajlDistance d in original template structureij;
4.4) statistical query sequence is to coming from the distance between template segments, here only add up less thanResidue to spacing, draw rectangular histogram and obtain distance spectrum, the distance of rectangular histogram abscissa is spaced apartWhen in template residue between distance in certain interval, then this interval sum just adds 1, if broken line graph existsIn certain distance interval peak value occurs, then the Prediction distance that the distance interval that this peak value is corresponding is in target sequence residue i to residue j;
5) Prediction distance in pth distance spectrum is extractedAnd variances sigmap, induced conformational decoy is built the distance probability of acceptance
Wherein, decoy is induced conformational, and profile represents distance spectrum, and N is that induced conformational predicts the distance spectrum bar number obtained, and p is distance spectrum index, and a and b is the resi-dues index of record in pth bar distance spectrum respectively,For the space length between induced conformational residue a to residue b,For the Prediction distance of pth bar distance spectrum, ��pDistribution variance for pth bar distance spectrum.
The technology of the present invention is contemplated that: a kind of normal distribution distance probability of acceptance model building method based on distance spectrum knowledge, first, download the high-resolution protein file that in Protein Data Bank, structure is known, remove the homology Sequence composition nonredundancy template base more than 30% by comparative sequences similarity; Secondly, by a sliding window, the protein structure in template base and search sequence are carried out similarity-rough set, select front 200 fragments of highest scoring in each position of search sequence and constitute fragment library file; Then choose the distance coming from same template segments structure on two positions of search sequence in fragment library and constitute distance spectrum; Finally according to the range distribution that residue spacing goes against accepted conventions, extract Prediction distance and the variance probability density function to its structure normal distribution, the structural similarity of comparison induced conformational, and accept conformation with the distance probability of acceptance of normal distribution.
The invention have the benefit that
Accompanying drawing explanation
Fig. 1 is the schematic diagram of the distance spectrum in protein 1ENH between the 20th residue and the 29th residue.
Fig. 2 is the schematic diagram of the distance probability of acceptance of the 20th residue and the 29th residue spacing in protein 1ENH.
Detailed description of the invention
Below in conjunction with accompanying drawing, the invention will be further described.
See figures.1.and.2, a kind of normal distribution distance probability of acceptance model building method based on distance spectrum knowledge, comprise the following steps:
1) nonredundancy template base is built:
1.1) from Protein Data Bank (PDB) website download resolution less thanHigh accuracy protein, whereinFor parasang,Rice;
1.2) protein containing a plurality of polypeptide chain is split into strand, and retain the longest chain and other chain comparative sequences similarities, remove the similarity redundancy polypeptide chain more than predetermined threshold value (taking 30%);
1.3) remaining polypeptide chain is sought sequence similarity I between twomn, add up the accumulative similarity of each chainWherein m, n are the sequence number of polypeptide chain, and N is the sum remaining all chains;
1.4) N bar chain is successively decreased arrangement according to accumulative similarity, start to compare removal sequence similarity successively with other chains more than the chain of predetermined threshold value (taking 30%) from the chain that accumulative similarity is maximum, obtain non-redundant proteins template base;
2) input inquiry sequence;
3) fragment library is generated:
3.1) build structural similarity function f (i, j), wherein i is search sequence resi-dues, and j is fragment structure;
3.1.1) search sequence by PSI-BLAST comparison predetermined number (taking 20) aminoacid obtain sequence frequency compose to obtain subitem Pq(i, k), wherein i is search sequence resi-dues, and k is predetermined number (taking 20) amino acid classes, and q is search sequence indications;
3.1.2)Lq(i, k) and Lt(j, is k) by the PSI-BLAST search sequence obtained and template sequence logarithmic spectrum;
3.1.3) the secondary structure prediction ss obtaining formwork structure is calculated by PSSpredt;
3.1.4) it is trained obtaining search sequence secondary structure prediction index s s to sequence spectrum by neural network procedureq;
3.1.5) template protein solvent accessibility parameter sa is obtained by EDTSurf calculatingt;
3.1.6) search sequence solvent accessibility index s a is obtained by neural network procedure predictionq;
3.1.7) can predict, by two layers of neural network procedure training sequence spectrum and secondary structure, the dihedral angle obtaining search sequence��q;
3.1.8) barycenter atom dihedral angle can be obtained by query protein dictionary��tFor formwork structure;
3.1.9)SPt(j, k) for the frequency matrix of each residue absolute presupposition quantity (taking 20) residue type in formwork structure.
3.1.10) structural similarity function
Wherein w1, w2, w3, w4, w5For weighted value;
3.2) by gapless threading method with 3 residues for monomeric unit, being mated with search sequence by the fragment structure in nonredundancy template base, according to structural similarity function f, (i, j) gives a mark to fragment structure;
3.3) using a sliding window when search sequence is with template segments structure matching, (i, j), selects front M the fragment of highest scoring on each position and constitutes fragment library the similarity score f of search sequence i position of comparison and jth fragment;
4) distance spectrum is obtained:
4.1) fragment that on traversal queries sequence position, M similarity is higher,It is the fragment on search sequence i-th position, Fl j(l=1 ..., M) it is the fragment on search sequence jth position;
4.2) a is usedikAnd ajlRepresent the fragment structure coming from same formwork structure selected on i and j;
4.3) a is calculatedikAnd ajlDistance d in original template structureij;
4.4) statistical query sequence is to coming from the distance between template segments, here only add up less thanResidue to spacing (residue to intermolecular forces along with distance increase and reduce), drawing rectangular histogram obtains distance spectrum, and the distance of rectangular histogram abscissa is spaced apartWhen in template residue between distance in certain interval, then this interval sum just adds 1, if broken line graph existsIn certain distance interval peak value occurs, then the Prediction distance that the distance interval that this peak value is corresponding is in target sequence residue i to residue j;
5) Prediction distance in pth distance spectrum is extractedAnd variances sigmap, induced conformational decoy is built the distance probability of acceptance
Wherein, decoy is induced conformational, and profile represents distance spectrum, and N is that induced conformational predicts the distance spectrum bar number obtained, and p is distance spectrum index, and a and b is the resi-dues index of record in pth bar distance spectrum respectively,For the space length between induced conformational residue a to residue b,For the Prediction distance of pth bar distance spectrum, ��pDistribution variance for pth bar distance spectrum.
The present embodiment with sequence length be 54 protein 1ENH for embodiment, a kind of normal distribution distance probability of acceptance model building method based on distance spectrum knowledge, wherein comprise the steps of
1) nonredundancy template base is built:
1.1) from Protein Data Bank (PDB) website download resolution less thanHigh accuracy protein;
1.2) protein containing a plurality of polypeptide chain is split into strand, and retain the longest chain and other chain comparative sequences similarities, remove the similarity redundancy polypeptide chain more than 30%;
1.3) remaining polypeptide chain is sought sequence similarity I between twomn, add up the accumulative similarity of each chainWherein m, n are the sequence number of polypeptide chain, and N is the total N=35627 remaining all chains;
1.4) N bar chain is successively decreased arrangement according to accumulative similarity, start to compare the removal sequence similarity chain more than 30% successively with other chains from the chain that accumulative similarity is maximum, obtain non-redundant proteins template base;
2) input inquiry sequence;
3) fragment library is generated:
3.1) build structural similarity function f (i, j), wherein i is search sequence resi-dues, and j is fragment structure;
3.1.1) search sequence by 20 aminoacid of PSI-BLAST comparison obtain sequence frequency compose to obtain subitem Pq(i, k), wherein i is search sequence resi-dues, and k is 20 amino acid classes, and q is search sequence indications;
3.1.2)Lq(i, k) and Lt(j, is k) by the PSI-BLAST search sequence obtained and template sequence logarithmic spectrum;
3.1.3) the secondary structure prediction ss obtaining formwork structure is calculated by PSSpredt;
3.1.4) it is trained obtaining search sequence secondary structure prediction index s s to sequence spectrum by neural network procedureq;
3.1.5) template protein solvent accessibility parameter sa is obtained by EDTSurf calculatingt;
3.1.6) search sequence solvent accessibility index s a is obtained by neural network procedure predictionq;
3.1.7) can predict, by two layers of neural network procedure training sequence spectrum and secondary structure, the dihedral angle obtaining search sequence��q;
3.1.8) barycenter atom dihedral angle can be obtained by query protein dictionary��tFor formwork structure;
3.1.9)SPt(j, k) for the frequency matrix of relative 20 residue type of each residue in formwork structure.
3.1.10) structural similarity function
Wherein w1=2, w2=6, w3=2.5, w4=12, w5=10 is weighted value;
3.2) by gapless threading method with 3 residues for monomeric unit, being mated with search sequence by the fragment structure in nonredundancy template base, according to structural similarity function f, (i, j) gives a mark to fragment structure;
3.3) sliding window is used when search sequence is with template segments structure matching, similarity score f (the i of search sequence i position of comparison and jth fragment, j), select front 200 fragments of highest scoring on each position and constitute fragment library;
4) distance spectrum is obtained:
4.1) fragment that on traversal queries sequence position, 200 similarities are higher,It is the fragment on search sequence i-th position, Fl j(l=1 ..., 200) it is the fragment on search sequence jth position;
4.2) a is usedikAnd ajlRepresent the fragment structure coming from same formwork structure selected on i and j;
4.3) a is calculatedikAnd ajlDistance d in original template structureij;
4.4) statistical query sequence is to coming from the distance between template segments, here only add up less thanResidue to spacing (residue to intermolecular forces along with distance increase and reduce), drawing rectangular histogram obtains distance spectrum, and the distance of rectangular histogram abscissa is spaced apartWhen in template residue between distance in certain interval, then this interval sum just adds 1, if broken line graph existsIn certain distance interval peak value occurs, then the Prediction distance that the distance interval that this peak value is corresponding is in target sequence residue i to residue j;
5) Prediction distance in pth distance spectrum is extractedAnd variances sigmap, induced conformational decoy is built the distance probability of acceptance
Wherein, decoy is induced conformational, and profile represents distance spectrum, and N is that induced conformational predicts the distance spectrum bar number obtained, and p is distance spectrum index, and a and b is the resi-dues index of record in pth bar distance spectrum respectively,For the space length between induced conformational residue a to residue b,For the Prediction distance of pth bar distance spectrum, ��pDistribution variance for pth bar distance spectrum;
With sequence length be 54 protein 1ENH for embodiment, above method is used to obtain the distance probability of acceptance between the distance spectrum of this protein and residue, residue to spacing spectrogram as it is shown in figure 1,1VII based on distance spectrum knowledge normal distribution distance the probability of acceptance show as shown in Figure 2.
The excellent results that the embodiment that the present invention provides that described above is shows, the obvious present invention is not only suitable for above-described embodiment, it can be done many variations and be carried out under not necessarily departing from essence spirit of the present invention and the premise without departing from content involved by flesh and blood of the present invention.
Claims (1)
1. the normal distribution distance probability of acceptance model building method based on distance spectrum knowledge, it is characterised in that: described model building method comprises the following steps:
1) nonredundancy template base is built:
1.1) from Protein Data Bank website download resolution less thanHigh accuracy protein, whereinFor parasang,
1.2) protein containing a plurality of polypeptide chain is split into strand, and retain the longest chain and other chain comparative sequences similarities, remove the similarity redundancy polypeptide chain more than predetermined threshold value;
1.3) remaining polypeptide chain is sought sequence similarity I between twomn, add up the accumulative similarity of each chainWherein m, n are the sequence number of polypeptide chain, and N is the sum remaining all chains;
1.4) N bar chain is successively decreased arrangement according to accumulative similarity, start to compare removal sequence similarity successively with other chains more than the chain of predetermined threshold value from the chain that accumulative similarity is maximum, obtain non-redundant proteins template base;
2) input inquiry sequence;
3) fragment library is generated:
3.1) build structural similarity function f (i, j), wherein i is search sequence resi-dues, and j is fragment structure;
3.1.1) search sequence by PSI-BLAST comparison predetermined number aminoacid obtain sequence frequency compose to obtain subitem Pq(i, k), wherein i is search sequence resi-dues, and k is predetermined number amino acid classes, and q is search sequence indications;
3.1.2)Lq(i, k) and Lt(j, is k) by the PSI-BLAST search sequence obtained and template sequence logarithmic spectrum;
3.1.3) the secondary structure prediction ss obtaining formwork structure is calculated by PSSpredt;
3.1.4) it is trained obtaining search sequence secondary structure prediction index s s to sequence spectrum by neural network procedureq;
3.1.5) template protein solvent accessibility parameter sa is obtained by EDTSurf calculatingt;
3.1.6) search sequence solvent accessibility index s a is obtained by neural network procedure predictionq;
3.1.7) can predict, by two layers of neural network procedure training sequence spectrum and secondary structure, the dihedral angle obtaining search sequence��q;
3.1.8) barycenter atom dihedral angle can be obtained by query protein dictionary��tFor formwork structure;
3.1.9)SPt(j, k) for the frequency matrix of each residue absolute presupposition quantity residue type in formwork structure;
3.1.10) structural similarity function
Wherein w1, w2, w3, w4, w5For weighted value;
3.2) by gapless threading method with 3 residues for monomeric unit, being mated with search sequence by the fragment structure in nonredundancy template base, according to structural similarity function f, (i, j) gives a mark to fragment structure;
3.3) using a sliding window when search sequence is with template segments structure matching, (i, j), selects front M the fragment of highest scoring on each position and constitutes fragment library the similarity score f of search sequence i position of comparison and jth fragment;
4) distance spectrum is obtained:
4.1) fragment that on traversal queries sequence position, M similarity is higher,It is the fragment on search sequence i-th position,It it is the fragment on search sequence jth position;
4.2) a is usedikAnd ajlRepresent the fragment structure coming from same formwork structure selected on i and j;
4.3) a is calculatedikAnd ajlDistance d in original template structureij;
4.4) statistical query sequence is to coming from the distance between template segments, here only add up less thanResidue to spacing, draw rectangular histogram and obtain distance spectrum, the distance of rectangular histogram abscissa is spaced apartWhen in template residue between distance in certain interval, then this interval sum just adds 1, if broken line graph existsIn certain distance interval peak value occurs, then the Prediction distance that the distance interval that this peak value is corresponding is in target sequence residue i to residue j;
5) Prediction distance in pth distance spectrum is extractedAnd variances sigmap, induced conformational decoy is built the distance probability of acceptance
Wherein, decoy is induced conformational, and profile represents distance spectrum, and N is that induced conformational predicts the distance spectrum bar number obtained, and p is distance spectrum index, and a and b is the resi-dues index of record in pth bar distance spectrum respectively,For the space length between induced conformational residue a to residue b,For the Prediction distance of pth bar distance spectrum, ��pDistribution variance for pth bar distance spectrum.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511008767.XA CN105653892A (en) | 2015-12-29 | 2015-12-29 | Distance spectrum intelligence based normal distribution distance receiving probability model construction method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511008767.XA CN105653892A (en) | 2015-12-29 | 2015-12-29 | Distance spectrum intelligence based normal distribution distance receiving probability model construction method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105653892A true CN105653892A (en) | 2016-06-08 |
Family
ID=56477143
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201511008767.XA Pending CN105653892A (en) | 2015-12-29 | 2015-12-29 | Distance spectrum intelligence based normal distribution distance receiving probability model construction method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105653892A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109378034A (en) * | 2018-08-28 | 2019-02-22 | 浙江工业大学 | A kind of protein prediction method based on range distribution estimation |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104951669A (en) * | 2015-06-08 | 2015-09-30 | 浙江工业大学 | Distance spectrum establishing method for protein structure predicting |
-
2015
- 2015-12-29 CN CN201511008767.XA patent/CN105653892A/en active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104951669A (en) * | 2015-06-08 | 2015-09-30 | 浙江工业大学 | Distance spectrum establishing method for protein structure predicting |
Non-Patent Citations (3)
Title |
---|
DONG XU AND YANG ZHANG: "Ab initio protein structure assembly using continuous structure fragments and optimized konwledge-based force field", 《PROTEINS: STRUCTURE, FUNCTION, BIOINFORMATICS》 * |
DONG XU AND YANG ZHANG: "Toward optimal fragment generations for ab initio protein structure assembly", 《PROTEINS: STRUCTURE, FUNCTION, BIOINFORMATICS》 * |
PREDRAG KUKIC ET AL.: "Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks", 《BMC BIOINFORMATICS》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109378034A (en) * | 2018-08-28 | 2019-02-22 | 浙江工业大学 | A kind of protein prediction method based on range distribution estimation |
CN109378034B (en) * | 2018-08-28 | 2021-06-18 | 浙江工业大学 | Protein prediction method based on distance distribution estimation |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Deng et al. | Protein structure prediction | |
Jisna et al. | Protein structure prediction: conventional and deep learning perspectives | |
Xu et al. | Toward optimal fragment generations for ab initio protein structure assembly | |
Liew et al. | Pattern recognition techniques for the emerging field of bioinformatics: A review | |
Lanchantin et al. | Deep motif: Visualizing genomic sequence classifications | |
Dong et al. | Identification of DNA-binding proteins by auto-cross covariance transformation | |
Zhang et al. | Adaptive compressive learning for prediction of protein–protein interactions from primary sequence | |
CN105468934A (en) | Method for constructing distance model between protein residues based on Bolzmann probability density function | |
Jiang et al. | Explainable deep hypergraph learning modeling the peptide secondary structure prediction | |
CN105808972A (en) | Method for predicting protein structure from local to global on basis of knowledge spectrum | |
CN106372456B (en) | A kind of Advances in protein structure prediction based on deep learning | |
Wang et al. | Improved fragment sampling for ab initio protein structure prediction using deep neural networks | |
Yue et al. | Fast palmprint identification with multiple templates per subject | |
Ali et al. | Target-DBPPred: an intelligent model for prediction of DNA-binding proteins using discrete wavelet transform based compression and light eXtreme gradient boosting | |
Sabarinathan et al. | ProSTRIP: A method to find similar structural repeats in three-dimensional protein structures | |
Lv et al. | AMPpred-EL: An effective antimicrobial peptide prediction model based on ensemble learning | |
Rani et al. | Cluster analysis method for multiple sequence alignment | |
Shen et al. | Detecting protein candidate fragments using a structural alphabet profile comparison approach | |
Ghualm et al. | Identification of pathway-specific protein domain by incorporating hyperparameter optimization based on 2D convolutional neural network | |
CN107563148B (en) | Ion index-based integral protein identification method and system | |
CN105653892A (en) | Distance spectrum intelligence based normal distribution distance receiving probability model construction method | |
Nguyen et al. | Protein interaction hotspot identification using sequence-based frequency-derived features | |
CN1598821A (en) | Seaching method of genome sequence data based on characteristic | |
Geethu et al. | Improved 3-D protein structure predictions using deep ResNet model | |
CN115691666A (en) | Sigma-based mutation pathogenicity prediction analysis method, system and equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160608 |