CN108804868B - Protein two-stage conformation space optimization method based on dihedral angle entropy value - Google Patents

Protein two-stage conformation space optimization method based on dihedral angle entropy value Download PDF

Info

Publication number
CN108804868B
CN108804868B CN201810297146.5A CN201810297146A CN108804868B CN 108804868 B CN108804868 B CN 108804868B CN 201810297146 A CN201810297146 A CN 201810297146A CN 108804868 B CN108804868 B CN 108804868B
Authority
CN
China
Prior art keywords
dihedral angle
residue
population
dihedral
stage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810297146.5A
Other languages
Chinese (zh)
Other versions
CN108804868A (en
Inventor
张贵军
刘俊
谢腾宇
周晓根
彭春祥
郝小虎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN201810297146.5A priority Critical patent/CN108804868B/en
Publication of CN108804868A publication Critical patent/CN108804868A/en
Application granted granted Critical
Publication of CN108804868B publication Critical patent/CN108804868B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A two-stage protein conformation space optimization method based on dihedral angle entropy values comprises the steps of firstly, initializing a population by utilizing the first stage and the second stage of a Rosetta protocol; in the exploration phase, the third phase of the iterative Rosetta protocol carries out large-range conformation search, and the change of dihedral angle entropy values in each generation of population is used as the standard of phase switching; in the enhancement stage, random local disturbance is carried out on the dihedral angle of the residues in the loop region on the basis of the fourth stage of the Rosetta protocol, dihedral angle energy is introduced to guide conformation updating in combination with the Rosetta score3 energy function, and adverse effects caused by inaccuracy of the energy function are reduced while exploration on the loop region is enhanced. The invention provides a protein two-stage conformation space optimization method based on dihedral angle entropy with higher prediction precision.

Description

Protein two-stage conformation space optimization method based on dihedral angle entropy value
Technical Field
The invention relates to the fields of bioinformatics and computer application, in particular to a protein two-stage conformation space optimization method based on dihedral angle entropy.
Background
Protein molecules are important components for composing all cells and tissues of human body. All important components of the body require the involvement of proteins. The protein has abundant functions and plays an important role in the normal operation of the organism. The three-dimensional structure of a protein determines the function of the protein, and the protein can only be correctly folded into a specific three-dimensional structure to generate a specific biological function. Diseases due to protein misfolding are not uncommon. Therefore, it is necessary to obtain a three-dimensional structure of a protein in order to understand the function of the protein and cure various diseases related to the protein.
Since the end of the twentieth century, the field of life science has developed rapidly, and proteins, which are macromolecules with the widest distribution and the most complex functions in organisms, are particularly and widely concerned and researched. Prediction of the three-dimensional structure of proteins is an important task of bioinformatics. Different proteins have different amino acid sequences, and all proteins are folded on the basis of one-dimensional sequences to form specific three-dimensional structures, and the understanding of the three-dimensional structures of the proteins is the basis for researching the biological functions of the proteins. At present, the three-dimensional structure of the protein is mainly obtained by X-ray crystal diffraction and nuclear magnetic resonance imaging technologies, but the two protein structures obtained by experimental methods have different costs and respective application limitations. Therefore, it is important to predict the three-dimensional structure of a protein by simulating the process of folding the protein from an amino acid sequence into a specific spatial structure by a computer technology in combination with bioinformatics. Predicting the tertiary structure of a protein based on sequence information remains a significant challenge. The technology based on the fragment assembly principle is the most advanced method for solving the problem, but the method has insufficient prediction precision for larger and harder proteins.
Rosetta is a relatively successful protein structure prediction method based on fragment assembly at present, but the method also has limitations. The main technical difficulties at present come from two aspects. On one hand, the conformation sampling capability of the existing protein structure prediction method is not strong enough, and the exploration efficiency of a loop area is extremely low; another aspect is that the energy function is not accurate enough and the low energy conformation does not necessarily correspond to the near-native conformation, which makes it impossible to produce a prediction result with high accuracy using only the energy function as a criterion for the conformational update.
Therefore, the current protein structure prediction methods have disadvantages and need to be improved.
Disclosure of Invention
In order to overcome the defects of the conventional protein structure prediction method, the invention provides a two-stage conformation space optimization method with higher prediction precision based on dihedral angle entropy. Firstly, initializing a population by utilizing a first phase and a second phase of a Rosetta protocol; in the exploration phase, the third phase of the iterative Rosetta protocol carries out large-range conformation search, and the change of dihedral angle entropy values in each generation of population is used as the standard of phase switching; in the enhancement stage, random local perturbation is carried out on the dihedral angle of the residues in the loop region on the basis of the fourth stage of the Rosetta protocol, and dihedral angle energy is introduced to guide conformation update in combination with the Rosetta score3 energy function.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a method for two-stage conformational space optimization of a protein based on dihedral entropy, the method comprising the steps of:
1) inputting sequence information and secondary structure information of a predicted protein;
2) setting parameters: population size NP, maximum number of iterations in exploration phase
Figure BDA0001618908550000021
Maximum number of iterations in enhancement phase
Figure BDA0001618908550000022
An entropy convergence threshold α;
3) population initialization: the first and second phases of the Rosetta protocol are iterated to generate an initial population with NP individuals
Figure BDA0001618908550000023
And let P be { P ═ P1,P2,...,PNP}=Pinit
4) Calculating the entropy value of the dihedral angle of the population by the following process:
4.1) dividing the dihedral angle range [ -180 °,180 °) into 36 dihedral angle blocks with 10 ° as a reference;
4.2) statistics of the probability that dihedral angles φ and ψ fall on each block for each residue of all conformations in the population
Figure BDA0001618908550000024
And
Figure BDA0001618908550000025
wherein i denotes the residue number, i ∈ {1, 2., l }, l denotes the protein sequence length, j denotes the dihedral corner block, j ∈ {1, 2., 36 }; obtaining the distribution probability of dihedral angles of each residue;
4.3) according to the formula
Figure BDA0001618908550000026
And
Figure BDA0001618908550000027
calculating the entropy value of dihedral angles phi and psi of each residue;
4.4) according to the formula
Figure BDA0001618908550000028
Solving a population dihedral angle entropy value;
5) the exploration phase comprises the following processes:
5.1) start iteration, set g11, where the number of iterations
Figure BDA00016189085500000210
5.2) performing Rosetta protocol third-phase assembly on each individual in the population P;
5.3) executing the step 4), and calculating the entropy value of the dihedral angle of the population
Figure BDA0001618908550000029
5.4) if g1And if the entropy value is more than 20, judging whether the entropy value is converged, wherein the process is as follows:
5.4.1) according to the formula
Figure BDA0001618908550000033
Wherein k belongs to {1, 2.,. 20}, and each entropy change of the current generation and the previous 20 generations is calculated;
5.4.2) according to the formula εmax=max{ε12,...,ε20Calculating the maximum entropy change;
5.4.3) if εmax<α,Ending the exploration phase, and entering the enhancement phase in the step 6); otherwise, executing step 5.5);
5.5)g1=g1+1;
5.6) if
Figure BDA0001618908550000031
Turning to step 5.2) to execute the next iteration; otherwise, ending the exploration phase and entering the enhancement phase in the step 6);
6) the enhancement stage comprises the following processes:
6.1) start iteration, let g2 be 1, where the number of iterations is
Figure BDA0001618908550000032
6.2) performing Rosetta protocol fourth phase assembly on each individual in the population P;
6.3) locally perturbing each individual of the population P, as follows:
6.3.1) order P*=PnIn which P isnRepresents individuals in the population P, and n belongs to {1, 2.., NP };
6.3.2) randomly selecting a residue position r of a loop region;
6.3.3) generating a random decimal rand1, rand1 belongs to [0,1 ];
6.3.4) if rand1 is less than 0.5, execute step 6.3.5); otherwise, go to step 6.3.6)
6.3.5) Pair of conformations P*The dihedral angle phi of the r-th residue in (b) is locally perturbed as follows:
6.3.5.1) reading conformation P*The value phi of the dihedral angle phi of the r-th residue in (1);
6.3.5.2) generates a random number rand2, rand2 ∈ [ -5 °,5 ° ];
6.3.5.3)phi=phi+rand2;
6.3.5.4) will form a conformation P*Replacing the value of dihedral angle phi for the residue at residue r in (1) with phi;
6.3.6) Pair of conformations P*The dihedral angle ψ of the r-th residue in (a) is locally perturbed as follows:
6.3.6.1) readTaking conformation P*The dihedral angle psi value of the r-th residue psi;
6.3.6.2) generates a random number rand3, rand3 ∈ [ -5 °,5 ° ];
6.3.6.3)psi=psi+rand3;
6.3.6.4) will form a conformation P*The value of dihedral angle ψ of the r-th residue in (a) is replaced with psi;
6.3.7) if P*Dihedral angle of medium perturbation and PnThe corresponding dihedral angles in the block are in the same dihedral angle block, and step 6.3.8) is executed; otherwise, go to step 6.3.9)
6.3.8) computing P with the Rosetta score3 energy function*And PnThe energy of (a); deciding whether to use P according to Metropolis criterion*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.3.9) directed conformational update according to dihedral energy, the procedure is as follows:
6.3.9.1) determining P based on the dihedral distribution probability of the last generation residue in the search stage*Dihedral angle and P of medium perturbationnDistribution probability p of dihedral angle block in which corresponding dihedral angle is located*And pn
6.3.9.2) according to formula e*=-ln(p*) And en=-ln(pn) Calculating dihedral angle energy;
6.3.9.3) determine whether to use P according to Metropolis criteria*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.4)g2=g2+1;
6.5) if
Figure BDA0001618908550000041
Skipping to step 6.2) to execute the next iteration; otherwise, ending the enhancement stage; 7) and outputting a prediction result according to a Rosetta protocol.
The invention has the beneficial effects that: guiding conformation search by counting entropy value information of two-sided angles of conformation population residues in the prediction process, and realizing exploration and enhancement switching according to the change of entropy values; randomly selecting a residue dihedral angle of a loop region in the conformation to carry out local disturbance, selecting a Rosetta score3 energy function or dihedral angle energy to guide conformation update according to the distribution of the dihedral angle, and reducing adverse effects brought by inaccurate energy function while enhancing the search for the loop region; thereby improving the accuracy of protein structure prediction.
Drawings
FIG. 1 is a schematic diagram of conformational update in the structural prediction of protein 1ctf by a protein two-stage conformational space optimization method based on dihedral entropy.
FIG. 2 is a three-dimensional structure diagram of protein 1ctf obtained by structure prediction based on dihedral entropy two-stage conformational space optimization method.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
Referring to fig. 1 and2, a two-stage conformation space optimization method of protein based on dihedral angle entropy includes the following steps:
1) inputting sequence information and secondary structure information of a predicted protein;
2) setting parameters: population size NP, maximum number of iterations in exploration phase
Figure BDA0001618908550000042
Maximum number of iterations in enhancement phase
Figure BDA0001618908550000051
An entropy convergence threshold α;
3) population initialization: the first and second phases of the Rosetta protocol are iterated to generate an initial population with NP individuals
Figure BDA0001618908550000052
And let P be { P ═ P1,P2,...,PNP}=Pinit
4) Calculating the entropy value of the dihedral angle of the population by the following process:
4.1) dividing the dihedral angle range [ -180 °,180 °) into 36 dihedral angle blocks with 10 ° as a reference;
4.2) statistics of the probability that dihedral angles φ and ψ fall on each block for each residue of all conformations in the population
Figure BDA0001618908550000053
And
Figure BDA0001618908550000054
wherein i denotes the residue number, i ∈ {1, 2., l }, l denotes the protein sequence length, j denotes the dihedral corner block, j ∈ {1, 2., 36 }; obtaining the distribution probability of dihedral angles of each residue;
4.3) according to the formula
Figure BDA0001618908550000055
And
Figure BDA0001618908550000056
calculating the entropy value of dihedral angles phi and psi of each residue;
4.4) according to the formula
Figure BDA0001618908550000057
Solving a population dihedral angle entropy value;
5) the exploration phase comprises the following processes:
5.1) start iteration, set g11, where the number of iterations
Figure BDA00016189085500000511
5.2) performing Rosetta protocol third-phase assembly on each individual in the population P;
5.3) executing the step 4), and calculating the entropy value of the dihedral angle of the population
Figure BDA00016189085500000512
5.4) if g1And if the entropy value is more than 20, judging whether the entropy value is converged, wherein the process is as follows:
5.4.1) according to the formula
Figure BDA00016189085500000510
Wherein k belongs to {1, 2.,. 20}, and each entropy change of the current generation and the previous 20 generations is calculated;
5.4.2) according to the formula εmax=max{ε12,...,ε20Calculating the maximum entropy change;
5.4.3) if εmaxIf the alpha is less than alpha, ending the exploration stage and entering the step 6) enhancement stage; otherwise, executing step 5.5);
5.5)g1=g1+1;
5.6) if
Figure BDA0001618908550000058
Turning to step 5.2) to execute the next iteration; otherwise, ending the exploration phase and entering the enhancement phase in the step 6);
6) the enhancement stage comprises the following processes:
6.1) begin iteration, set g21, where the number of iterations
Figure BDA0001618908550000059
6.2) performing Rosetta protocol fourth phase assembly on each individual in the population P;
6.3) locally perturbing each individual of the population P, as follows:
6.3.1) order P*=PnIn which P isnRepresents individuals in the population P, and n belongs to {1, 2.., NP };
6.3.2) randomly selecting a residue position r of a loop region;
6.3.3) generating a random decimal rand1, rand1 belongs to [0,1 ];
6.3.4) if rand1 is less than 0.5, execute step 6.3.5); otherwise, go to step 6.3.6)
6.3.5) Pair of conformations P*The dihedral angle phi of the r-th residue in (b) is locally perturbed as follows:
6.3.5.1) reading conformation P*The value phi of the dihedral angle phi of the r-th residue in (1);
6.3.5.2) generates a random number rand2, rand2 ∈ [ -5 °,5 ° ];
6.3.5.3)phi=phi+rand2;
6.3.5.4) will form a conformation P*Replacing the value of dihedral angle phi for the residue at residue r in (1) with phi;
6.3.6) Pair of conformations P*The dihedral angle ψ of the r-th residue in (a) is locally perturbed as follows:
6.3.6.1) reading conformation P*The dihedral angle psi value of the r-th residue psi;
6.3.6.2) generates a random number rand3, rand3 ∈ [ -5 °,5 ° ];
6.3.6.3)psi=psi+rand3;
6.3.6.4) will form a conformation P*The value of dihedral angle ψ of the r-th residue in (a) is replaced with psi;
6.3.7) if P*Dihedral angle of medium perturbation and PnThe corresponding dihedral angles in the block are in the same dihedral angle block, and step 6.3.8) is executed; otherwise, go to step 6.3.9)
6.3.8) computing P with the Rosetta score3 energy function*And PnThe energy of (a); deciding whether to use P according to Metropolis criterion*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.3.9) directed conformational update according to dihedral energy, the procedure is as follows:
6.3.9.1) determining P based on the dihedral distribution probability of the last generation residue in the search stage*Dihedral angle and P of medium perturbationnDistribution probability p of dihedral angle block in which corresponding dihedral angle is located*And pn
6.3.9.2) according to formula e*=-ln(p*) And en=-ln(pn) Calculating dihedral angle energy;
6.3.9.3) determine whether to use P according to Metropolis criteria*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.4)g2=g2+1;
6.5) if
Figure BDA0001618908550000071
Jump toTo step 6.2) executing next iteration; otherwise, ending the enhancement stage; 7) and outputting a prediction result according to a Rosetta protocol.
In this embodiment, taking protein 1ctf with a sequence length of 68 as an example, a two-stage conformational space optimization method of protein based on dihedral entropy comprises the following steps:
1) inputting sequence information and secondary structure information of protein 1 ctf;
2) setting parameters: population size NP is 100, maximum number of iterations in exploration phase
Figure BDA00016189085500000712
Maximum number of iterations in enhancement phase
Figure BDA0001618908550000072
The entropy convergence threshold α is 0.005;
3) population initialization: the first and second phases of the Rosetta protocol are iterated to generate an initial population with NP individuals
Figure BDA0001618908550000073
And let P be { P ═ P1,P2,...,PNP}=Pinit
4) Calculating the entropy value of the dihedral angle of the population by the following process:
4.1) dividing the dihedral angle range [ -180 °,180 °) into 36 dihedral angle blocks with 10 ° as a reference;
4.2) statistics of the probability that dihedral angles φ and ψ fall on each block for each residue of all conformations in the population
Figure BDA0001618908550000074
And
Figure BDA0001618908550000075
wherein i denotes the residue number, i ∈ {1, 2., l }, l denotes the protein sequence length, j denotes the dihedral corner block, j ∈ {1, 2., 36 }; obtaining the distribution probability of dihedral angles of each residue;
4.3) according to the formula
Figure BDA0001618908550000076
And
Figure BDA0001618908550000077
calculating the entropy value of dihedral angles phi and psi of each residue;
4.4) according to the formula
Figure BDA0001618908550000078
Solving a population dihedral angle entropy value;
5) the exploration phase comprises the following processes:
5.1) start iteration, set g11, where the number of iterations
Figure BDA0001618908550000079
5.2) performing Rosetta protocol third-phase assembly on each individual in the population P;
5.3) executing the step 4), and calculating the entropy value of the dihedral angle of the population
Figure BDA00016189085500000710
5.4) if g1And if the entropy value is more than 20, judging whether the entropy value is converged, wherein the process is as follows:
5.4.1) according to the formula
Figure BDA00016189085500000711
Wherein k belongs to {1, 2.,. 20}, and each entropy change of the current generation and the previous 20 generations is calculated;
5.4.2) according to the formula εmax=max{ε12,...,ε20Calculating the maximum entropy change;
5.4.3) if εmaxIf the alpha is less than alpha, ending the exploration stage and entering the step 6) enhancement stage; otherwise, executing step 5.5);
5.5)g1=g1+1;
5.6) if
Figure BDA0001618908550000081
Go to the step5.2) executing the next iteration; otherwise, ending the exploration phase and entering the enhancement phase in the step 6);
6) the enhancement stage comprises the following processes:
6.1) begin iteration, set g21, where the number of iterations
Figure BDA0001618908550000082
6.2) performing Rosetta protocol fourth phase assembly on each individual in the population P;
6.3) locally perturbing each individual of the population P, as follows:
6.3.1) order P*=PnIn which P isnRepresents individuals in the population P, and n belongs to {1, 2.., NP };
6.3.2) randomly selecting a residue position r of a loop region;
6.3.3) generating a random decimal rand1, rand1 belongs to [0,1 ];
6.3.4) if rand1 is less than 0.5, execute step 6.3.5); otherwise, go to step 6.3.6)
6.3.5) Pair of conformations P*The dihedral angle phi of the r-th residue in (b) is locally perturbed as follows:
6.3.5.1) reading conformation P*The value phi of the dihedral angle phi of the r-th residue in (1);
6.3.5.2) generates a random number rand2, rand2 ∈ [ -5 °,5 ° ];
6.3.5.3)phi=phi+rand2;
6.3.5.4) will form a conformation P*Replacing the value of dihedral angle phi for the residue at residue r in (1) with phi;
6.3.6) Pair of conformations P*The dihedral angle ψ of the r-th residue in (a) is locally perturbed as follows:
6.3.6.1) reading conformation P*The dihedral angle psi value of the r-th residue psi;
6.3.6.2) generates a random number rand3, rand3 ∈ [ -5 °,5 ° ];
6.3.6.3)psi=psi+rand3;
6.3.6.4) will form a conformation P*Substitution of the value of dihedral angle ψ for the r-th residue in (1)Is psi;
6.3.7) if P*Dihedral angle of medium perturbation and PnThe corresponding dihedral angles in the block are in the same dihedral angle block, and step 6.3.8) is executed; otherwise, go to step 6.3.9)
6.3.8) computing P with the Rosetta score3 energy function*And PnThe energy of (a); deciding whether to use P according to Metropolis criterion*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.3.9) directed conformational update according to dihedral energy, the procedure is as follows:
6.3.9.1) determining P based on the dihedral distribution probability of the last generation residue in the search stage*Dihedral angle and P of medium perturbationnDistribution probability p of dihedral angle block in which corresponding dihedral angle is located*And pn
6.3.9.2) according to formula e*=-ln(p*) And en=-ln(pn) Calculating dihedral angle energy;
6.3.9.3) determine whether to use P according to Metropolis criteria*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.4)g2=g2+1;
6.5) if
Figure BDA0001618908550000091
Skipping to step 6.2) to execute the next iteration; otherwise, ending the enhancement stage; 7) and outputting a prediction result according to a Rosetta protocol.
Taking the protein 1ctf with the amino acid sequence length of 68 as an example, the near-native conformation of the protein is obtained by the method, and the conformation renewal scheme is shown in FIG. 1; minimum root mean square deviation of
Figure BDA0001618908550000092
The prediction structure is shown in fig. 2.
The foregoing is a predictive description of the invention as embodied in one embodiment, and it will be apparent that the invention is not limited to the embodiment described above, but may be embodied with various modifications without departing from the basic inventive concept and without departing from the spirit thereof.

Claims (1)

1. A protein two-stage conformation space optimization method based on dihedral angle entropy is characterized in that: the two-stage conformation space optimization method of the protein comprises the following steps:
1) inputting sequence information and secondary structure information of a predicted protein;
2) setting parameters: population size NP, maximum number of iterations in exploration phase
Figure FDA0001618908540000011
Maximum number of iterations in enhancement phase
Figure FDA0001618908540000012
An entropy convergence threshold α;
3) population initialization: the first and second phases of the Rosetta protocol are iterated to generate an initial population with NP individuals
Figure FDA0001618908540000013
And let P be { P ═ P1,P2,...,PNP}=Pinit
4) Calculating the entropy value of the dihedral angle of the population by the following process:
4.1) dividing the dihedral angle range [ -180 °,180 °) into 36 dihedral angle blocks with 10 ° as a reference;
4.2) statistics of the probability that dihedral angles φ and ψ fall on each block for each residue of all conformations in the population
Figure FDA0001618908540000014
And
Figure FDA0001618908540000015
wherein i denotes the residue number, i ∈ {1, 2., l }, l denotes the protein sequence length, j denotes the dihedral corner block, j ∈ {1, 2., 36 }; obtaining the distribution probability of dihedral angle of each residue;
4.3) according to the formula
Figure FDA0001618908540000016
And
Figure FDA0001618908540000017
calculating the entropy value of dihedral angles phi and psi of each residue;
4.4) according to the formula
Figure FDA0001618908540000018
Solving a population dihedral angle entropy value;
5) the exploration phase comprises the following processes:
5.1) start iteration, set g11, where the number of iterations
Figure FDA0001618908540000019
5.2) performing Rosetta protocol third-phase assembly on each individual in the population P;
5.3) executing the step 4), and calculating the entropy value of the dihedral angle of the population
Figure FDA00016189085400000110
5.4) if g1And if the entropy value is more than 20, judging whether the entropy value is converged, wherein the process is as follows:
5.4.1) according to the formula
Figure FDA00016189085400000111
Wherein k belongs to {1, 2.,. 20}, and each entropy change of the current generation and the previous 20 generations is calculated;
5.4.2) according to the formula εmax=max{ε12,...,ε20Calculating the maximum entropy change;
5.4.3) if εmaxIf the alpha is less than alpha, ending the exploration stage and entering the step 6) enhancement stage; otherwise, executing step 5.5);
5.5)g1=g1+1;
5.6) if
Figure FDA0001618908540000021
Turning to step 5.2) to execute the next iteration; otherwise, ending the exploration phase and entering the enhancement phase in the step 6);
6) the enhancement stage comprises the following processes:
6.1) begin iteration, set g21, where the number of iterations
Figure FDA0001618908540000022
6.2) performing Rosetta protocol fourth phase assembly on each individual in the population P;
6.3) locally perturbing each individual of the population P, as follows:
6.3.1) order P*=PnIn which P isnRepresents individuals in the population P, and n belongs to {1, 2.., NP };
6.3.2) randomly selecting a residue position r of a loop region;
6.3.3) generating a random decimal rand1, rand1 belongs to [0,1 ];
6.3.4) if rand1 is less than 0.5, execute step 6.3.5); otherwise, go to step 6.3.6)
6.3.5) Pair of conformations P*The dihedral angle phi of the r-th residue in (b) is locally perturbed as follows:
6.3.5.1) reading conformation P*The value phi of the dihedral angle phi of the r-th residue in (1);
6.3.5.2) generates a random number rand2, rand2 ∈ [ -5 °,5 ° ];
6.3.5.3)phi=phi+rand2;
6.3.5.4) will form a conformation P*Replacing the value of dihedral angle phi for the residue at residue r in (1) with phi;
6.3.6) Pair of conformations P*The dihedral angle ψ of the r-th residue in (a) is locally perturbed as follows:
6.3.6.1) reading conformation P*The dihedral angle psi value of the r-th residue psi;
6.3.6.2) generates a random number rand3, rand3 ∈ [ -5 °,5 ° ];
6.3.6.3)psi=psi+rand3;
6.3.6.4) will form a conformation P*The value of dihedral angle ψ of the r-th residue in (a) is replaced with psi;
6.3.7) if P*Dihedral angle of medium perturbation and PnThe corresponding dihedral angles in the block are in the same dihedral angle block, and step 6.3.8) is executed; otherwise, go to step 6.3.9)
6.3.8) computing P with the Rosetta score3 energy function*And PnThe energy of (a); deciding whether to use P according to Metropolis criterion*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.3.9) directed conformational update according to dihedral energy, the procedure is as follows:
6.3.9.1) determining P based on the dihedral distribution probability of the last generation residue in the search stage*Dihedral angle and P of medium perturbationnDistribution probability p of dihedral angle block in which corresponding dihedral angle is located*And pn
6.3.9.2) according to formula e*=-ln(p*) And en=-ln(pn) Calculating dihedral angle energy;
6.3.9.3) determine whether to use P according to Metropolis criteria*Replacement of PnI.e. to determine whether to accept the pair of conformations PnLocal perturbation of (2);
6.4)g2=g2+1;
6.5) if
Figure FDA0001618908540000031
Skipping to step 6.2) to execute the next iteration; otherwise, the enhancement phase is ended
7) And outputting a prediction result according to a Rosetta protocol.
CN201810297146.5A 2018-03-30 2018-03-30 Protein two-stage conformation space optimization method based on dihedral angle entropy value Active CN108804868B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810297146.5A CN108804868B (en) 2018-03-30 2018-03-30 Protein two-stage conformation space optimization method based on dihedral angle entropy value

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810297146.5A CN108804868B (en) 2018-03-30 2018-03-30 Protein two-stage conformation space optimization method based on dihedral angle entropy value

Publications (2)

Publication Number Publication Date
CN108804868A CN108804868A (en) 2018-11-13
CN108804868B true CN108804868B (en) 2021-05-18

Family

ID=64094711

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810297146.5A Active CN108804868B (en) 2018-03-30 2018-03-30 Protein two-stage conformation space optimization method based on dihedral angle entropy value

Country Status (1)

Country Link
CN (1) CN108804868B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110706741B (en) * 2019-08-27 2021-08-03 浙江工业大学 Multi-modal protein structure prediction method based on sequence niche

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106778059A (en) * 2016-12-19 2017-05-31 浙江工业大学 A kind of colony's Advances in protein structure prediction based on Rosetta local enhancements
CN108647486A (en) * 2018-03-22 2018-10-12 浙江工业大学 A kind of prediction method for three-dimensional structure of protein based on conformational diversity strategy

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106778059A (en) * 2016-12-19 2017-05-31 浙江工业大学 A kind of colony's Advances in protein structure prediction based on Rosetta local enhancements
CN108647486A (en) * 2018-03-22 2018-10-12 浙江工业大学 A kind of prediction method for three-dimensional structure of protein based on conformational diversity strategy

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
《Dihedral Angle-Based Sampling of Natural Product Polyketide: Conformations Application to Permeability Prediction》;Qin Wang等;《Journal of Chemical Information and Modeling》;20161012;全文 *
《Protein Structure Idealization How accurately is it possible to model protein structures with dihedral angles?》;Xuefeng Cui等;《Algorithms for Molecular Biology》;20131231;全文 *
《The Maximum Entropy Fallacy Redux》;Erik Aurell;《PLOS Computational Biology》;20160512;全文 *

Also Published As

Publication number Publication date
CN108804868A (en) 2018-11-13

Similar Documents

Publication Publication Date Title
Deng et al. Protein structure prediction
CN107609342B (en) Protein conformation search method based on secondary structure space distance constraint
CN107633159B (en) Protein conformation space search method based on distance similarity
CN108846256B (en) Group protein structure prediction method based on residue contact information
CN108647486B (en) Protein three-dimensional structure prediction method based on conformation diversity strategy
CN109033744B (en) Protein structure prediction method based on residue distance and contact information
CN109086566B (en) Group protein structure prediction method based on fragment resampling
CN109360599B (en) Protein structure prediction method based on residue contact information cross strategy
CN109360596B (en) Protein conformation space optimization method based on differential evolution local disturbance
Simoncini et al. Efficient sampling in fragment-based protein structure prediction using an estimation of distribution algorithm
CN108804868B (en) Protein two-stage conformation space optimization method based on dihedral angle entropy value
CN109215733B (en) Protein structure prediction method based on residue contact information auxiliary evaluation
CN109872770B (en) Variable strategy protein structure prediction method combined with displacement degree evaluation
Zhao et al. Protein secondary structure prediction using NMR chemical shift data
CN109086565B (en) Protein structure prediction method based on contact constraint between residues
CN109346128B (en) Protein structure prediction method based on residue information dynamic selection strategy
Hao et al. A novel method using abstract convex underestimation in ab-initio protein structure prediction for guiding search in conformational feature space
Zhang et al. Two-stage distance feature-based optimization algorithm for de novo protein structure prediction
Liu et al. A self-organizing algorithm for modeling protein loops
CN109300506B (en) Protein structure prediction method based on specific distance constraint
CN109390035B (en) Protein conformation space optimization method based on local structure comparison
CN109411013B (en) Group protein structure prediction method based on individual specific variation strategy
CN109243525B (en) Method for predicting staged protein structure based on population entropy
CN109448785B (en) Protein structure prediction method for enhancing Loop region structure by using Laplace graph
CN109461471B (en) Adaptive protein structure prediction method based on championship mechanism

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant