CN108804868B - Protein two-stage conformation space optimization method based on dihedral angle entropy value - Google Patents
Protein two-stage conformation space optimization method based on dihedral angle entropy value Download PDFInfo
- Publication number
- CN108804868B CN108804868B CN201810297146.5A CN201810297146A CN108804868B CN 108804868 B CN108804868 B CN 108804868B CN 201810297146 A CN201810297146 A CN 201810297146A CN 108804868 B CN108804868 B CN 108804868B
- Authority
- CN
- China
- Prior art keywords
- dihedral angle
- residue
- population
- dihedral
- stage
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 46
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 46
- 238000000034 method Methods 0.000 title claims abstract description 44
- 238000005457 optimization Methods 0.000 title claims abstract description 13
- DSCFFEYYQKSRSV-KLJZZCKASA-N D-pinitol Chemical compound CO[C@@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@H]1O DSCFFEYYQKSRSV-KLJZZCKASA-N 0.000 claims description 4
- 230000003094 perturbing effect Effects 0.000 claims description 4
- 230000002411 adverse Effects 0.000 abstract description 2
- 230000000694 effects Effects 0.000 abstract description 2
- 230000006870 function Effects 0.000 description 12
- 238000000455 protein structure prediction Methods 0.000 description 5
- 125000003275 alpha amino acid group Chemical group 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000008827 biological function Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000013421 nuclear magnetic resonance imaging Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
A two-stage protein conformation space optimization method based on dihedral angle entropy values comprises the steps of firstly, initializing a population by utilizing the first stage and the second stage of a Rosetta protocol; in the exploration phase, the third phase of the iterative Rosetta protocol carries out large-range conformation search, and the change of dihedral angle entropy values in each generation of population is used as the standard of phase switching; in the enhancement stage, random local disturbance is carried out on the dihedral angle of the residues in the loop region on the basis of the fourth stage of the Rosetta protocol, dihedral angle energy is introduced to guide conformation updating in combination with the Rosetta score3 energy function, and adverse effects caused by inaccuracy of the energy function are reduced while exploration on the loop region is enhanced. The invention provides a protein two-stage conformation space optimization method based on dihedral angle entropy with higher prediction precision.
Description
Technical Field
The invention relates to the fields of bioinformatics and computer application, in particular to a protein two-stage conformation space optimization method based on dihedral angle entropy.
Background
Protein molecules are important components for composing all cells and tissues of human body. All important components of the body require the involvement of proteins. The protein has abundant functions and plays an important role in the normal operation of the organism. The three-dimensional structure of a protein determines the function of the protein, and the protein can only be correctly folded into a specific three-dimensional structure to generate a specific biological function. Diseases due to protein misfolding are not uncommon. Therefore, it is necessary to obtain a three-dimensional structure of a protein in order to understand the function of the protein and cure various diseases related to the protein.
Since the end of the twentieth century, the field of life science has developed rapidly, and proteins, which are macromolecules with the widest distribution and the most complex functions in organisms, are particularly and widely concerned and researched. Prediction of the three-dimensional structure of proteins is an important task of bioinformatics. Different proteins have different amino acid sequences, and all proteins are folded on the basis of one-dimensional sequences to form specific three-dimensional structures, and the understanding of the three-dimensional structures of the proteins is the basis for researching the biological functions of the proteins. At present, the three-dimensional structure of the protein is mainly obtained by X-ray crystal diffraction and nuclear magnetic resonance imaging technologies, but the two protein structures obtained by experimental methods have different costs and respective application limitations. Therefore, it is important to predict the three-dimensional structure of a protein by simulating the process of folding the protein from an amino acid sequence into a specific spatial structure by a computer technology in combination with bioinformatics. Predicting the tertiary structure of a protein based on sequence information remains a significant challenge. The technology based on the fragment assembly principle is the most advanced method for solving the problem, but the method has insufficient prediction precision for larger and harder proteins.
Rosetta is a relatively successful protein structure prediction method based on fragment assembly at present, but the method also has limitations. The main technical difficulties at present come from two aspects. On one hand, the conformation sampling capability of the existing protein structure prediction method is not strong enough, and the exploration efficiency of a loop area is extremely low; another aspect is that the energy function is not accurate enough and the low energy conformation does not necessarily correspond to the near-native conformation, which makes it impossible to produce a prediction result with high accuracy using only the energy function as a criterion for the conformational update.
Therefore, the current protein structure prediction methods have disadvantages and need to be improved.
Disclosure of Invention
In order to overcome the defects of the conventional protein structure prediction method, the invention provides a two-stage conformation space optimization method with higher prediction precision based on dihedral angle entropy. Firstly, initializing a population by utilizing a first phase and a second phase of a Rosetta protocol; in the exploration phase, the third phase of the iterative Rosetta protocol carries out large-range conformation search, and the change of dihedral angle entropy values in each generation of population is used as the standard of phase switching; in the enhancement stage, random local perturbation is carried out on the dihedral angle of the residues in the loop region on the basis of the fourth stage of the Rosetta protocol, and dihedral angle energy is introduced to guide conformation update in combination with the Rosetta score3 energy function.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a method for two-stage conformational space optimization of a protein based on dihedral entropy, the method comprising the steps of:
1) inputting sequence information and secondary structure information of a predicted protein;
2) setting parameters: population size NP, maximum number of iterations in exploration phaseMaximum number of iterations in enhancement phaseAn entropy convergence threshold α;
3) population initialization: the first and second phases of the Rosetta protocol are iterated to generate an initial population with NP individualsAnd let P be { P ═ P1,P2,...,PNP}=Pinit;
4) Calculating the entropy value of the dihedral angle of the population by the following process:
4.1) dividing the dihedral angle range [ -180 °,180 °) into 36 dihedral angle blocks with 10 ° as a reference;
4.2) statistics of the probability that dihedral angles φ and ψ fall on each block for each residue of all conformations in the populationAndwherein i denotes the residue number, i ∈ {1, 2., l }, l denotes the protein sequence length, j denotes the dihedral corner block, j ∈ {1, 2., 36 }; obtaining the distribution probability of dihedral angles of each residue;
4.3) according to the formulaAndcalculating the entropy value of dihedral angles phi and psi of each residue;
5) the exploration phase comprises the following processes:
5.2) performing Rosetta protocol third-phase assembly on each individual in the population P;
5.3) executing the step 4), and calculating the entropy value of the dihedral angle of the population
5.4) if g1And if the entropy value is more than 20, judging whether the entropy value is converged, wherein the process is as follows:
5.4.1) according to the formulaWherein k belongs to {1, 2.,. 20}, and each entropy change of the current generation and the previous 20 generations is calculated;
5.4.2) according to the formula εmax=max{ε1,ε2,...,ε20Calculating the maximum entropy change;
5.4.3) if εmax<α,Ending the exploration phase, and entering the enhancement phase in the step 6); otherwise, executing step 5.5);
5.5)g1=g1+1;
5.6) ifTurning to step 5.2) to execute the next iteration; otherwise, ending the exploration phase and entering the enhancement phase in the step 6);
6) the enhancement stage comprises the following processes:
6.2) performing Rosetta protocol fourth phase assembly on each individual in the population P;
6.3) locally perturbing each individual of the population P, as follows:
6.3.1) order P*=PnIn which P isnRepresents individuals in the population P, and n belongs to {1, 2.., NP };
6.3.2) randomly selecting a residue position r of a loop region;
6.3.3) generating a random decimal rand1, rand1 belongs to [0,1 ];
6.3.4) if rand1 is less than 0.5, execute step 6.3.5); otherwise, go to step 6.3.6)
6.3.5) Pair of conformations P*The dihedral angle phi of the r-th residue in (b) is locally perturbed as follows:
6.3.5.1) reading conformation P*The value phi of the dihedral angle phi of the r-th residue in (1);
6.3.5.2) generates a random number rand2, rand2 ∈ [ -5 °,5 ° ];
6.3.5.3)phi=phi+rand2;
6.3.5.4) will form a conformation P*Replacing the value of dihedral angle phi for the residue at residue r in (1) with phi;
6.3.6) Pair of conformations P*The dihedral angle ψ of the r-th residue in (a) is locally perturbed as follows:
6.3.6.1) readTaking conformation P*The dihedral angle psi value of the r-th residue psi;
6.3.6.2) generates a random number rand3, rand3 ∈ [ -5 °,5 ° ];
6.3.6.3)psi=psi+rand3;
6.3.6.4) will form a conformation P*The value of dihedral angle ψ of the r-th residue in (a) is replaced with psi;
6.3.7) if P*Dihedral angle of medium perturbation and PnThe corresponding dihedral angles in the block are in the same dihedral angle block, and step 6.3.8) is executed; otherwise, go to step 6.3.9)
6.3.8) computing P with the Rosetta score3 energy function*And PnThe energy of (a); deciding whether to use P according to Metropolis criterion*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.3.9) directed conformational update according to dihedral energy, the procedure is as follows:
6.3.9.1) determining P based on the dihedral distribution probability of the last generation residue in the search stage*Dihedral angle and P of medium perturbationnDistribution probability p of dihedral angle block in which corresponding dihedral angle is located*And pn;
6.3.9.2) according to formula e*=-ln(p*) And en=-ln(pn) Calculating dihedral angle energy;
6.3.9.3) determine whether to use P according to Metropolis criteria*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.4)g2=g2+1;
6.5) ifSkipping to step 6.2) to execute the next iteration; otherwise, ending the enhancement stage; 7) and outputting a prediction result according to a Rosetta protocol.
The invention has the beneficial effects that: guiding conformation search by counting entropy value information of two-sided angles of conformation population residues in the prediction process, and realizing exploration and enhancement switching according to the change of entropy values; randomly selecting a residue dihedral angle of a loop region in the conformation to carry out local disturbance, selecting a Rosetta score3 energy function or dihedral angle energy to guide conformation update according to the distribution of the dihedral angle, and reducing adverse effects brought by inaccurate energy function while enhancing the search for the loop region; thereby improving the accuracy of protein structure prediction.
Drawings
FIG. 1 is a schematic diagram of conformational update in the structural prediction of protein 1ctf by a protein two-stage conformational space optimization method based on dihedral entropy.
FIG. 2 is a three-dimensional structure diagram of protein 1ctf obtained by structure prediction based on dihedral entropy two-stage conformational space optimization method.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
Referring to fig. 1 and2, a two-stage conformation space optimization method of protein based on dihedral angle entropy includes the following steps:
1) inputting sequence information and secondary structure information of a predicted protein;
2) setting parameters: population size NP, maximum number of iterations in exploration phaseMaximum number of iterations in enhancement phaseAn entropy convergence threshold α;
3) population initialization: the first and second phases of the Rosetta protocol are iterated to generate an initial population with NP individualsAnd let P be { P ═ P1,P2,...,PNP}=Pinit;
4) Calculating the entropy value of the dihedral angle of the population by the following process:
4.1) dividing the dihedral angle range [ -180 °,180 °) into 36 dihedral angle blocks with 10 ° as a reference;
4.2) statistics of the probability that dihedral angles φ and ψ fall on each block for each residue of all conformations in the populationAndwherein i denotes the residue number, i ∈ {1, 2., l }, l denotes the protein sequence length, j denotes the dihedral corner block, j ∈ {1, 2., 36 }; obtaining the distribution probability of dihedral angles of each residue;
4.3) according to the formulaAndcalculating the entropy value of dihedral angles phi and psi of each residue;
5) the exploration phase comprises the following processes:
5.2) performing Rosetta protocol third-phase assembly on each individual in the population P;
5.3) executing the step 4), and calculating the entropy value of the dihedral angle of the population
5.4) if g1And if the entropy value is more than 20, judging whether the entropy value is converged, wherein the process is as follows:
5.4.1) according to the formulaWherein k belongs to {1, 2.,. 20}, and each entropy change of the current generation and the previous 20 generations is calculated;
5.4.2) according to the formula εmax=max{ε1,ε2,...,ε20Calculating the maximum entropy change;
5.4.3) if εmaxIf the alpha is less than alpha, ending the exploration stage and entering the step 6) enhancement stage; otherwise, executing step 5.5);
5.5)g1=g1+1;
5.6) ifTurning to step 5.2) to execute the next iteration; otherwise, ending the exploration phase and entering the enhancement phase in the step 6);
6) the enhancement stage comprises the following processes:
6.2) performing Rosetta protocol fourth phase assembly on each individual in the population P;
6.3) locally perturbing each individual of the population P, as follows:
6.3.1) order P*=PnIn which P isnRepresents individuals in the population P, and n belongs to {1, 2.., NP };
6.3.2) randomly selecting a residue position r of a loop region;
6.3.3) generating a random decimal rand1, rand1 belongs to [0,1 ];
6.3.4) if rand1 is less than 0.5, execute step 6.3.5); otherwise, go to step 6.3.6)
6.3.5) Pair of conformations P*The dihedral angle phi of the r-th residue in (b) is locally perturbed as follows:
6.3.5.1) reading conformation P*The value phi of the dihedral angle phi of the r-th residue in (1);
6.3.5.2) generates a random number rand2, rand2 ∈ [ -5 °,5 ° ];
6.3.5.3)phi=phi+rand2;
6.3.5.4) will form a conformation P*Replacing the value of dihedral angle phi for the residue at residue r in (1) with phi;
6.3.6) Pair of conformations P*The dihedral angle ψ of the r-th residue in (a) is locally perturbed as follows:
6.3.6.1) reading conformation P*The dihedral angle psi value of the r-th residue psi;
6.3.6.2) generates a random number rand3, rand3 ∈ [ -5 °,5 ° ];
6.3.6.3)psi=psi+rand3;
6.3.6.4) will form a conformation P*The value of dihedral angle ψ of the r-th residue in (a) is replaced with psi;
6.3.7) if P*Dihedral angle of medium perturbation and PnThe corresponding dihedral angles in the block are in the same dihedral angle block, and step 6.3.8) is executed; otherwise, go to step 6.3.9)
6.3.8) computing P with the Rosetta score3 energy function*And PnThe energy of (a); deciding whether to use P according to Metropolis criterion*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.3.9) directed conformational update according to dihedral energy, the procedure is as follows:
6.3.9.1) determining P based on the dihedral distribution probability of the last generation residue in the search stage*Dihedral angle and P of medium perturbationnDistribution probability p of dihedral angle block in which corresponding dihedral angle is located*And pn;
6.3.9.2) according to formula e*=-ln(p*) And en=-ln(pn) Calculating dihedral angle energy;
6.3.9.3) determine whether to use P according to Metropolis criteria*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.4)g2=g2+1;
6.5) ifJump toTo step 6.2) executing next iteration; otherwise, ending the enhancement stage; 7) and outputting a prediction result according to a Rosetta protocol.
In this embodiment, taking protein 1ctf with a sequence length of 68 as an example, a two-stage conformational space optimization method of protein based on dihedral entropy comprises the following steps:
1) inputting sequence information and secondary structure information of protein 1 ctf;
2) setting parameters: population size NP is 100, maximum number of iterations in exploration phaseMaximum number of iterations in enhancement phaseThe entropy convergence threshold α is 0.005;
3) population initialization: the first and second phases of the Rosetta protocol are iterated to generate an initial population with NP individualsAnd let P be { P ═ P1,P2,...,PNP}=Pinit;
4) Calculating the entropy value of the dihedral angle of the population by the following process:
4.1) dividing the dihedral angle range [ -180 °,180 °) into 36 dihedral angle blocks with 10 ° as a reference;
4.2) statistics of the probability that dihedral angles φ and ψ fall on each block for each residue of all conformations in the populationAndwherein i denotes the residue number, i ∈ {1, 2., l }, l denotes the protein sequence length, j denotes the dihedral corner block, j ∈ {1, 2., 36 }; obtaining the distribution probability of dihedral angles of each residue;
4.3) according to the formulaAndcalculating the entropy value of dihedral angles phi and psi of each residue;
5) the exploration phase comprises the following processes:
5.2) performing Rosetta protocol third-phase assembly on each individual in the population P;
5.3) executing the step 4), and calculating the entropy value of the dihedral angle of the population
5.4) if g1And if the entropy value is more than 20, judging whether the entropy value is converged, wherein the process is as follows:
5.4.1) according to the formulaWherein k belongs to {1, 2.,. 20}, and each entropy change of the current generation and the previous 20 generations is calculated;
5.4.2) according to the formula εmax=max{ε1,ε2,...,ε20Calculating the maximum entropy change;
5.4.3) if εmaxIf the alpha is less than alpha, ending the exploration stage and entering the step 6) enhancement stage; otherwise, executing step 5.5);
5.5)g1=g1+1;
5.6) ifGo to the step5.2) executing the next iteration; otherwise, ending the exploration phase and entering the enhancement phase in the step 6);
6) the enhancement stage comprises the following processes:
6.2) performing Rosetta protocol fourth phase assembly on each individual in the population P;
6.3) locally perturbing each individual of the population P, as follows:
6.3.1) order P*=PnIn which P isnRepresents individuals in the population P, and n belongs to {1, 2.., NP };
6.3.2) randomly selecting a residue position r of a loop region;
6.3.3) generating a random decimal rand1, rand1 belongs to [0,1 ];
6.3.4) if rand1 is less than 0.5, execute step 6.3.5); otherwise, go to step 6.3.6)
6.3.5) Pair of conformations P*The dihedral angle phi of the r-th residue in (b) is locally perturbed as follows:
6.3.5.1) reading conformation P*The value phi of the dihedral angle phi of the r-th residue in (1);
6.3.5.2) generates a random number rand2, rand2 ∈ [ -5 °,5 ° ];
6.3.5.3)phi=phi+rand2;
6.3.5.4) will form a conformation P*Replacing the value of dihedral angle phi for the residue at residue r in (1) with phi;
6.3.6) Pair of conformations P*The dihedral angle ψ of the r-th residue in (a) is locally perturbed as follows:
6.3.6.1) reading conformation P*The dihedral angle psi value of the r-th residue psi;
6.3.6.2) generates a random number rand3, rand3 ∈ [ -5 °,5 ° ];
6.3.6.3)psi=psi+rand3;
6.3.6.4) will form a conformation P*Substitution of the value of dihedral angle ψ for the r-th residue in (1)Is psi;
6.3.7) if P*Dihedral angle of medium perturbation and PnThe corresponding dihedral angles in the block are in the same dihedral angle block, and step 6.3.8) is executed; otherwise, go to step 6.3.9)
6.3.8) computing P with the Rosetta score3 energy function*And PnThe energy of (a); deciding whether to use P according to Metropolis criterion*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.3.9) directed conformational update according to dihedral energy, the procedure is as follows:
6.3.9.1) determining P based on the dihedral distribution probability of the last generation residue in the search stage*Dihedral angle and P of medium perturbationnDistribution probability p of dihedral angle block in which corresponding dihedral angle is located*And pn;
6.3.9.2) according to formula e*=-ln(p*) And en=-ln(pn) Calculating dihedral angle energy;
6.3.9.3) determine whether to use P according to Metropolis criteria*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.4)g2=g2+1;
6.5) ifSkipping to step 6.2) to execute the next iteration; otherwise, ending the enhancement stage; 7) and outputting a prediction result according to a Rosetta protocol.
Taking the protein 1ctf with the amino acid sequence length of 68 as an example, the near-native conformation of the protein is obtained by the method, and the conformation renewal scheme is shown in FIG. 1; minimum root mean square deviation ofThe prediction structure is shown in fig. 2.
The foregoing is a predictive description of the invention as embodied in one embodiment, and it will be apparent that the invention is not limited to the embodiment described above, but may be embodied with various modifications without departing from the basic inventive concept and without departing from the spirit thereof.
Claims (1)
1. A protein two-stage conformation space optimization method based on dihedral angle entropy is characterized in that: the two-stage conformation space optimization method of the protein comprises the following steps:
1) inputting sequence information and secondary structure information of a predicted protein;
2) setting parameters: population size NP, maximum number of iterations in exploration phaseMaximum number of iterations in enhancement phaseAn entropy convergence threshold α;
3) population initialization: the first and second phases of the Rosetta protocol are iterated to generate an initial population with NP individualsAnd let P be { P ═ P1,P2,...,PNP}=Pinit;
4) Calculating the entropy value of the dihedral angle of the population by the following process:
4.1) dividing the dihedral angle range [ -180 °,180 °) into 36 dihedral angle blocks with 10 ° as a reference;
4.2) statistics of the probability that dihedral angles φ and ψ fall on each block for each residue of all conformations in the populationAndwherein i denotes the residue number, i ∈ {1, 2., l }, l denotes the protein sequence length, j denotes the dihedral corner block, j ∈ {1, 2., 36 }; obtaining the distribution probability of dihedral angle of each residue;
4.3) according to the formulaAndcalculating the entropy value of dihedral angles phi and psi of each residue;
5) the exploration phase comprises the following processes:
5.2) performing Rosetta protocol third-phase assembly on each individual in the population P;
5.3) executing the step 4), and calculating the entropy value of the dihedral angle of the population
5.4) if g1And if the entropy value is more than 20, judging whether the entropy value is converged, wherein the process is as follows:
5.4.1) according to the formulaWherein k belongs to {1, 2.,. 20}, and each entropy change of the current generation and the previous 20 generations is calculated;
5.4.2) according to the formula εmax=max{ε1,ε2,...,ε20Calculating the maximum entropy change;
5.4.3) if εmaxIf the alpha is less than alpha, ending the exploration stage and entering the step 6) enhancement stage; otherwise, executing step 5.5);
5.5)g1=g1+1;
5.6) ifTurning to step 5.2) to execute the next iteration; otherwise, ending the exploration phase and entering the enhancement phase in the step 6);
6) the enhancement stage comprises the following processes:
6.2) performing Rosetta protocol fourth phase assembly on each individual in the population P;
6.3) locally perturbing each individual of the population P, as follows:
6.3.1) order P*=PnIn which P isnRepresents individuals in the population P, and n belongs to {1, 2.., NP };
6.3.2) randomly selecting a residue position r of a loop region;
6.3.3) generating a random decimal rand1, rand1 belongs to [0,1 ];
6.3.4) if rand1 is less than 0.5, execute step 6.3.5); otherwise, go to step 6.3.6)
6.3.5) Pair of conformations P*The dihedral angle phi of the r-th residue in (b) is locally perturbed as follows:
6.3.5.1) reading conformation P*The value phi of the dihedral angle phi of the r-th residue in (1);
6.3.5.2) generates a random number rand2, rand2 ∈ [ -5 °,5 ° ];
6.3.5.3)phi=phi+rand2;
6.3.5.4) will form a conformation P*Replacing the value of dihedral angle phi for the residue at residue r in (1) with phi;
6.3.6) Pair of conformations P*The dihedral angle ψ of the r-th residue in (a) is locally perturbed as follows:
6.3.6.1) reading conformation P*The dihedral angle psi value of the r-th residue psi;
6.3.6.2) generates a random number rand3, rand3 ∈ [ -5 °,5 ° ];
6.3.6.3)psi=psi+rand3;
6.3.6.4) will form a conformation P*The value of dihedral angle ψ of the r-th residue in (a) is replaced with psi;
6.3.7) if P*Dihedral angle of medium perturbation and PnThe corresponding dihedral angles in the block are in the same dihedral angle block, and step 6.3.8) is executed; otherwise, go to step 6.3.9)
6.3.8) computing P with the Rosetta score3 energy function*And PnThe energy of (a); deciding whether to use P according to Metropolis criterion*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.3.9) directed conformational update according to dihedral energy, the procedure is as follows:
6.3.9.1) determining P based on the dihedral distribution probability of the last generation residue in the search stage*Dihedral angle and P of medium perturbationnDistribution probability p of dihedral angle block in which corresponding dihedral angle is located*And pn;
6.3.9.2) according to formula e*=-ln(p*) And en=-ln(pn) Calculating dihedral angle energy;
6.3.9.3) determine whether to use P according to Metropolis criteria*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.4)g2=g2+1;
6.5) ifSkipping to step 6.2) to execute the next iteration; otherwise, the enhancement phase is ended
7) And outputting a prediction result according to a Rosetta protocol.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810297146.5A CN108804868B (en) | 2018-03-30 | 2018-03-30 | Protein two-stage conformation space optimization method based on dihedral angle entropy value |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810297146.5A CN108804868B (en) | 2018-03-30 | 2018-03-30 | Protein two-stage conformation space optimization method based on dihedral angle entropy value |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108804868A CN108804868A (en) | 2018-11-13 |
CN108804868B true CN108804868B (en) | 2021-05-18 |
Family
ID=64094711
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810297146.5A Active CN108804868B (en) | 2018-03-30 | 2018-03-30 | Protein two-stage conformation space optimization method based on dihedral angle entropy value |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108804868B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110706741B (en) * | 2019-08-27 | 2021-08-03 | 浙江工业大学 | Multi-modal protein structure prediction method based on sequence niche |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106778059A (en) * | 2016-12-19 | 2017-05-31 | 浙江工业大学 | A kind of colony's Advances in protein structure prediction based on Rosetta local enhancements |
CN108647486A (en) * | 2018-03-22 | 2018-10-12 | 浙江工业大学 | A kind of prediction method for three-dimensional structure of protein based on conformational diversity strategy |
-
2018
- 2018-03-30 CN CN201810297146.5A patent/CN108804868B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106778059A (en) * | 2016-12-19 | 2017-05-31 | 浙江工业大学 | A kind of colony's Advances in protein structure prediction based on Rosetta local enhancements |
CN108647486A (en) * | 2018-03-22 | 2018-10-12 | 浙江工业大学 | A kind of prediction method for three-dimensional structure of protein based on conformational diversity strategy |
Non-Patent Citations (3)
Title |
---|
《Dihedral Angle-Based Sampling of Natural Product Polyketide: Conformations Application to Permeability Prediction》;Qin Wang等;《Journal of Chemical Information and Modeling》;20161012;全文 * |
《Protein Structure Idealization How accurately is it possible to model protein structures with dihedral angles?》;Xuefeng Cui等;《Algorithms for Molecular Biology》;20131231;全文 * |
《The Maximum Entropy Fallacy Redux》;Erik Aurell;《PLOS Computational Biology》;20160512;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN108804868A (en) | 2018-11-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Deng et al. | Protein structure prediction | |
CN107609342B (en) | Protein conformation search method based on secondary structure space distance constraint | |
CN107633159B (en) | Protein conformation space search method based on distance similarity | |
CN108846256B (en) | Group protein structure prediction method based on residue contact information | |
CN108647486B (en) | Protein three-dimensional structure prediction method based on conformation diversity strategy | |
CN109033744B (en) | Protein structure prediction method based on residue distance and contact information | |
CN109086566B (en) | Group protein structure prediction method based on fragment resampling | |
CN109360599B (en) | Protein structure prediction method based on residue contact information cross strategy | |
CN109360596B (en) | Protein conformation space optimization method based on differential evolution local disturbance | |
Simoncini et al. | Efficient sampling in fragment-based protein structure prediction using an estimation of distribution algorithm | |
CN108804868B (en) | Protein two-stage conformation space optimization method based on dihedral angle entropy value | |
CN109215733B (en) | Protein structure prediction method based on residue contact information auxiliary evaluation | |
CN109872770B (en) | Variable strategy protein structure prediction method combined with displacement degree evaluation | |
Zhao et al. | Protein secondary structure prediction using NMR chemical shift data | |
CN109086565B (en) | Protein structure prediction method based on contact constraint between residues | |
CN109346128B (en) | Protein structure prediction method based on residue information dynamic selection strategy | |
Hao et al. | A novel method using abstract convex underestimation in ab-initio protein structure prediction for guiding search in conformational feature space | |
Zhang et al. | Two-stage distance feature-based optimization algorithm for de novo protein structure prediction | |
Liu et al. | A self-organizing algorithm for modeling protein loops | |
CN109300506B (en) | Protein structure prediction method based on specific distance constraint | |
CN109390035B (en) | Protein conformation space optimization method based on local structure comparison | |
CN109411013B (en) | Group protein structure prediction method based on individual specific variation strategy | |
CN109243525B (en) | Method for predicting staged protein structure based on population entropy | |
CN109448785B (en) | Protein structure prediction method for enhancing Loop region structure by using Laplace graph | |
CN109461471B (en) | Adaptive protein structure prediction method based on championship mechanism |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |