CN112085244A - Residue contact map-based multi-objective optimization protein structure prediction method - Google Patents

Residue contact map-based multi-objective optimization protein structure prediction method Download PDF

Info

Publication number
CN112085244A
CN112085244A CN202010704125.8A CN202010704125A CN112085244A CN 112085244 A CN112085244 A CN 112085244A CN 202010704125 A CN202010704125 A CN 202010704125A CN 112085244 A CN112085244 A CN 112085244A
Authority
CN
China
Prior art keywords
contact
population
residue
conformation
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010704125.8A
Other languages
Chinese (zh)
Inventor
张贵军
陈芳
彭春祥
李亭
刘俊
周晓根
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN202010704125.8A priority Critical patent/CN112085244A/en
Publication of CN112085244A publication Critical patent/CN112085244A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Business, Economics & Management (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Quality & Reliability (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biomedical Technology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)

Abstract

A multi-objective optimization protein structure prediction method based on a residue contact map comprises the following steps: firstly, predicting a residue Contact map of a target protein sequence by utilizing tripletRNAs, MetaPSICOV, RaptorX and STOP-Contact; secondly, designing a scoring function to initialize the population; then, sampling the population through fragment recombination and assembly; finally, aiming at different prediction servers, four contact energies are obtained through calculation according to the designed contact energy function, and a weighted scoring function E of a harmonic mean and a standard deviation is adoptedtotal(Cn) All conformations in the population are ranked and the first conformation in the last generation is selected as the predicted result. The invention can relieveThe single residue contact map prediction is inaccurate, so that the protein is misfolded, the diversity is increased, and the overall prediction accuracy is improved.

Description

Residue contact map-based multi-objective optimization protein structure prediction method
Technical Field
The invention relates to the fields of bioinformatics and computer application, in particular to a residue contact map-based multi-objective optimization protein structure prediction method.
Background
Proteins are the cornerstone of life, almost all cellular activities are involved in proteins, and the three-dimensional structure of proteins determines their specific biological functions. Therefore, structural information of proteins is crucial in protein research. For example, the catalytic function of an enzyme is performed by a portion of the protein chain, i.e., the active sites exposed on the surface of the protein. Protein interactions and protein interactions with nucleic acids, inhibitors, activators are also limited to specific protein surface areas. Therefore, it is only possible to design targeted drugs that interact with the protein surface if the protein structure is known. At present, protein structure is mainly determined by means of biological wet experiments. However, the biological assay method has the disadvantages of complicated operation, long time consumption, high cost and the like. Therefore, based on computer technology, predicting the tertiary structure of a protein directly from the primary or secondary structure of the protein is a major problem in structural bioinformatics research.
Protein structure prediction mainly has the following two problems: (1) the problem of the ability to sample under high-dimensional and complex energy functions. (2) And the energy function model is inaccurate. At present, we are still far from constructing a sufficiently accurate force field that can direct the folding of the target sequence in the correct direction, resulting in a mathematically optimal solution that does not necessarily correspond to the native structure of the target protein. Aiming at the problem of inaccuracy of an energy function, the method helps to improve the prediction accuracy of the protein structure by using residue contact information to assist the energy function, but the currently predicted residue contact information is inaccurate and easily influences the overall prediction accuracy.
Therefore, improvement is needed to solve the problem that the existing inaccurate residue contact information results in insufficient accuracy of protein structure prediction.
Disclosure of Invention
In order to solve the problem that the prediction accuracy of a protein structure is insufficient due to inaccurate contact information of a single residue, the invention provides a multi-objective optimization protein structure prediction method based on a residue contact map.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a method for residue contact map-based multi-objective optimized protein structure prediction, the method comprising the steps of:
1) inputting sequence information of a target protein, and predicting a residue Contact map of the target sequence by using a triplets server (zhanglab, ccmb, med, umich, edu/resttriplet), a metapsiccov server (bioif, cs, ucl, ac, uk/psipled), a RaptorX server (RaptorX, uchicago, edu/ContactMap) and a STOP-Contact server (spark-lab, org/server/shot-Contact);
2) acquiring fragment library files of 3 fragments and 9 fragments from a ROBETTA server (http:// www.robetta.org /) according to a target protein sequence;
3) according to the confidence coefficient between residue pairs in the residue contact map, arranging the confidence coefficient of the residue pairs from large to small, and selecting the first 2L pieces of contact information, wherein L is the sequence length of the target protein;
4) setting parameters: population size NP, maximum number of iterations G, temperature factor beta, omegaT,ωP,ωRAnd ωSRespectively representing the weights corresponding to Contact energies calculated from residue Contact maps predicted by the servers TripletRes, MetaPSICOV, RaptorX and STOP-Contact;
5) setting G ═ 1, G ∈ {1, 2.., G };
6) population initialization, the process is as follows:
6.1) first and second phases using the Rosetta protocolGeneration of 2NP initial conformations Cn,n={1,2,…,2NP};
6.2) calculating the energy of each individual in the population using the Rosetta score3 energy function, Individual CnHas an energy of score3 (C)n) And calculating four contact energies E according to the residue contact maps predicted by the four different contact prediction serversC(Cn) And are respectively denoted as EC1(Cn),EC2(Cn),EC3(Cn) And EC4(Cn) In which EC(Cn) The calculation formula of (a) is as follows:
Figure BDA0002594070230000021
Figure BDA0002594070230000022
wherein N is the total number of residue pairs,
Figure BDA0002594070230000031
is the confidence of the contact of the kth residue in the residue contact map to the (i, j),
Figure BDA0002594070230000032
is the distance between the kth residue pair (i, j) in the target protein, dconIs a threshold value of 8, EC1(Cn) Representing contact energy calculated from residue contact maps predicted by the TripletRes server, EC2(Cn) Representing contact energy calculated from residue contact maps predicted by MetaPSICOV servers, EC3(Cn) Representing the calculated contact energy, E, from a residue contact map predicted by a Raptorx serverC4(Cn) Represents the Contact energy calculated from the residue Contact map predicted by the STOP-Contact server;
6.3) according to Etotal(Cn) Sequencing the individuals in the population from low to high, taking the first NP individuals as an initialization population, Etotal(Cn) The calculation formula of (a) is as follows:
Figure BDA0002594070230000033
Figure BDA0002594070230000034
Figure BDA0002594070230000035
Figure BDA0002594070230000036
Figure BDA0002594070230000037
7) conformational recombination, assembly, as follows:
7.1)Cnfor the nth individual in the population and selecting two conformations C in the population which are different from the target conformationn1,Cn2Three different fragments were randomly selected in these two conformations, replacing conformation CnFragment of the corresponding position in the sequence to generate a new recombinant conformation C'n
7.2) according to each conformation C 'in the starting population'nAssembly of fragments 3 and 9 to give a new conformation C ″n
7.3) if score3 (C'n)>score3(C″n) Then, the conformation C ″, is usednSubstitute for C'nOtherwise, receiving the new conformation by using the Boltzmann probability to generate new NP individuals of the child population, forming a new population together with the NP individuals in the parent population, wherein the new population is 2NP individuals, and realizing the update of the new population;
8) performing population updating operation, repeating the steps 6.2) to 6.3), and selecting front NP individuals as new populations;
9) setting G to G +1, and if G > G, executing step 10); otherwise, go to 7);
10) the first conformation in the G-th generation is output as the final predicted result.
The technical conception of the invention is as follows: a multi-objective optimization protein structure prediction method based on a residue contact map comprises the following steps: firstly, predicting a residue Contact map of a target protein sequence by utilizing tripletRNAs, MetaPSICOV, RaptorX and STOP-Contact; secondly, designing a scoring function to initialize the population; then, sampling the population through fragment recombination and assembly; finally, aiming at different prediction servers, four contact energies are obtained through calculation according to the designed contact energy function, and a weighted scoring function E of a harmonic mean and a standard deviation is adoptedtotal(Cn) All conformations in the population are ranked and the first conformation in the last generation is selected as the predicted result. The multi-objective optimization protein structure prediction method based on the residue contact map can relieve protein misfolding caused by inaccurate prediction of a single residue contact map, thereby increasing diversity and improving overall prediction accuracy.
The invention has the beneficial effects that: according to a multi-objective optimization strategy, the defect that the conformation is evaluated only by a single energy function or a single residue contact map is overcome by using four pieces of residue contact map information, the population diversity is increased, and the overall prediction precision is improved.
Drawings
FIG. 1 is a schematic diagram of conformational update in the structural prediction of protein 1BG8 by a residue contact map-based multi-objective optimization protein structure prediction method.
FIG. 2 is a three-dimensional structure diagram of protein 1BG8 predicted by a residue contact map-based multi-objective optimization protein structure prediction method.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
Referring to fig. 1 and 2, a method for residue contact map-based multi-objective optimized protein structure prediction, the method comprising the steps of:
1) inputting sequence information of a target protein, and predicting a residue Contact map of the target sequence by using a triplets server (zhanglab, ccmb, med, umich, edu/resttriplet), a metapsiccov server (bioif, cs, ucl, ac, uk/psipled), a RaptorX server (RaptorX, uchicago, edu/ContactMap) and a STOP-Contact server (spark-lab, org/server/shot-Contact);
2) acquiring fragment library files of 3 fragments and 9 fragments from a ROBETTA server (http:// www.robetta.org /) according to a target protein sequence;
3) according to the confidence coefficient between residue pairs in the residue contact map, arranging the confidence coefficient of the residue pairs from large to small, and selecting the first 2L pieces of contact information, wherein L is the sequence length of the target protein;
4) setting parameters: population size NP, maximum number of iterations G, temperature factor beta, omegaT,ωP,ωRAnd ωSRespectively representing the weights corresponding to Contact energies calculated from residue Contact maps predicted by the servers TripletRes, MetaPSICOV, RaptorX and STOP-Contact;
5) setting G ═ 1, G ∈ {1, 2.., G };
6) population initialization, the process is as follows:
6.1) generating 2NP initial constellations C by using the first and second stages of the Rosetta protocoln,n={1,2,…,2NP};
6.2) calculating the energy of each individual in the population using the Rosetta score3 energy function, Individual CnHas an energy of score3 (C)n) And calculating four contact energies E according to the residue contact maps predicted by the four different contact prediction serversC(Cn) And are respectively denoted as EC1(Cn),EC2(Cn),EC3(Cn) And EC4(Cn) In which EC(Cn) The calculation formula of (a) is as follows:
Figure BDA0002594070230000051
Figure BDA0002594070230000052
wherein N is the total number of residue pairs,
Figure BDA0002594070230000053
is the confidence of the contact of the kth residue in the residue contact map to the (i, j),
Figure BDA0002594070230000054
is the distance between the kth residue pair (i, j) in the target protein, dconIs a threshold value of 8, EC1(Cn) Representing contact energy calculated from residue contact maps predicted by the TripletRes server, EC2(Cn) Representing contact energy calculated from residue contact maps predicted by MetaPSICOV servers, EC3(Cn) Representing the calculated contact energy, E, from a residue contact map predicted by a Raptorx serverC4(Cn) Represents the Contact energy calculated from the residue Contact map predicted by the STOP-Contact server;
6.3) according to Etotal(Cn) Sequencing the individuals in the population from low to high, taking the first NP individuals as an initialization population, Etotal(Cn) The calculation formula of (a) is as follows:
Figure BDA0002594070230000061
Figure BDA0002594070230000062
Figure BDA0002594070230000063
Figure BDA0002594070230000064
Figure BDA0002594070230000065
7) conformational recombination, assembly, as follows:
7.1)Cnfor the nth individual in the population and selecting two conformations C in the population which are different from the target conformationn1,Cn2Three different fragments were randomly selected in these two conformations, replacing conformation CnFragment of the corresponding position in the sequence to generate a new recombinant conformation C'n
7.2) according to each conformation C 'in the starting population'nAssembly of fragments 3 and 9 to give a new conformation C ″n
7.3) if score3 (C'n)>score3(C″n) Then, the conformation C ″, is usednSubstitute for C'nOtherwise, receiving the new conformation by using the Boltzmann probability to generate new NP individuals of the child population, forming a new population together with the NP individuals in the parent population, wherein the new population is 2NP individuals, and realizing the update of the new population;
8) performing population updating operation, repeating the steps 6.2) to 6.3), and selecting front NP individuals as new populations;
9) setting G to G +1, and if G > G, executing step 10); otherwise, go to 7);
10) the first conformation in the G-th generation is output as the final predicted result.
This example illustrates protein 1BG8 with sequence length 76 as an example, a method for predicting protein structure based on residue contact map for multi-objective optimization, the method comprises the following steps:
1) inputting sequence information of a target protein, and predicting a residue Contact map of the target sequence by using a triplets server (zhanglab, ccmb, med, umich, edu/resttriplet), a metapsiccov server (bioif, cs, ucl, ac, uk/psipled), a RaptorX server (RaptorX, uchicago, edu/ContactMap) and a STOP-Contact server (spark-lab, org/server/shot-Contact);
2) acquiring fragment library files of 3 fragments and 9 fragments from a ROBETTA server (http:// www.robetta.org /) according to a target protein sequence;
3) according to the confidence coefficient between residue pairs in the residue contact map, arranging the confidence coefficient of the residue pairs from large to small, and selecting the first 2L pieces of contact information, wherein L is the sequence length of the target protein;
4) setting parameters: the population size NP is 200, the maximum number of iterations G is 500, the temperature factor β is 2, ωT=0.213,ωP=0.189,ωR0.233 and ωS0.175 represents the weight corresponding to the Contact energy calculated from the residue Contact maps predicted by the servers TripletRes, MetaPSICOV, RaptorX and STOP-Contact, respectively;
5) setting G ═ 1, G ∈ {1, 2.., G };
6) population initialization, the process is as follows:
6.1) generating 2NP initial constellations C by using the first and second stages of the Rosetta protocoln,n={1,2,…,2NP};
6.2) calculating the energy of each individual in the population using the Rosetta score3 energy function, Individual CnHas an energy of score3 (C)n) And calculating four contact energies E according to the residue contact maps predicted by the four different contact prediction serversC(Cn) And are respectively denoted as EC1(Cn),EC2(Cn),EC3(Cn) And EC4(Cn) In which EC(Cn) The calculation formula of (a) is as follows:
Figure BDA0002594070230000071
Figure BDA0002594070230000072
wherein N is the total number of residue pairs,
Figure BDA0002594070230000073
is the confidence of the contact of the kth residue in the residue contact map to the (i, j),
Figure BDA0002594070230000074
is the distance between the kth residue pair (i, j) in the target protein, dconIs a threshold value of 8, EC1(Cn) Representing contact energy calculated from residue contact maps predicted by the TripletRes server, EC2(Cn) Representing contact energy calculated from residue contact maps predicted by MetaPSICOV servers, EC3(Cn) Representing the calculated contact energy, E, from a residue contact map predicted by a Raptorx serverC4(Cn) Represents the Contact energy calculated from the residue Contact map predicted by the STOP-Contact server;
6.3) according to Etotal(Cn) Sequencing the individuals in the population from low to high, taking the first NP individuals as an initialization population, Etotal(Cn) The calculation formula of (a) is as follows:
Figure BDA0002594070230000081
Figure BDA0002594070230000082
Figure BDA0002594070230000083
Figure BDA0002594070230000084
Figure BDA0002594070230000085
7) conformational recombination, assembly, as follows:
7.1)Cnfor the nth individual in the population and selecting two conformations C in the population which are different from the target conformationn1,Cn2Three different fragments were randomly selected in these two conformations, replacing conformation CnFragment of the corresponding position in the sequence to generate a new recombinant conformation C'n
7.2) according to each conformation C 'in the starting population'nAssembly of fragments 3 and 9 to give a new conformation C ″n
7.3) if score3 (C'n)>score3(C″n) Then, the conformation C ″, is usednSubstitute for C'nOtherwise, receiving the new conformation by using the Boltzmann probability to generate new NP individuals of the child population, forming a new population together with the NP individuals in the parent population, wherein the new population is 2NP individuals, and realizing the update of the new population;
8) performing population updating operation, repeating the steps 6.2) to 6.3), and selecting front NP individuals as new populations;
9) setting G to G +1, and if G > G, executing step 10); otherwise, go to 7);
10) the first conformation in the G-th generation is output as the final predicted result.
Using the protein 1C8C with sequence length 76 as an example, the above method can obtain the near-native conformation of the protein, the conformation renewal scheme is shown in FIG. 1, and the root mean square deviation between the structure obtained after 500 generations of operation and the native structure is
Figure BDA0002594070230000091
The predicted three-dimensional structure is shown in fig. 2.
While the foregoing illustrates one embodiment of the invention showing advantageous results, it will be apparent that the invention is not limited to the above-described embodiment, but is capable of numerous modifications without departing from the basic inventive concepts and without exceeding the scope of the inventive concepts.

Claims (1)

1. A protein structure prediction method based on multi-objective optimization is characterized by comprising the following steps: the method comprises the following steps:
1) inputting sequence information of a target protein, and predicting a residue Contact graph of the target sequence by using a tripleters server, a MetaPSICOV server, a Raptorx server and a STOP-Contact server;
2) acquiring fragment library files of 3 fragments and 9 fragments from a ROBETTA server according to a target protein sequence;
3) according to the confidence coefficient between residue pairs in the residue contact map, arranging the confidence coefficient of the residue pairs from large to small, and selecting the first 2L pieces of contact information, wherein L is the sequence length of the target protein;
4) setting parameters: population size NP, maximum number of iterations G, temperature factor beta, omegaT,ωP,ωRAnd ωSRespectively representing the weights corresponding to Contact energies calculated from residue Contact maps predicted by the servers TripletRes, MetaPSICOV, RaptorX and STOP-Contact;
5) setting G ═ 1, G ∈ {1, 2.., G };
6) population initialization, the process is as follows:
6.1) generating 2NP initial constellations C by using the first and second stages of the Rosetta protocoln,n={1,2,…,2NP};
6.2) calculating the energy of each individual in the population using the Rosetta score3 energy function, Individual CnHas an energy of score3 (C)n) And calculating four contact energies E according to the residue contact maps predicted by the four different contact prediction serversC(Cn) And are respectively denoted as EC1(Cn),EC2(Cn),EC3(Cn) And EC4(Cn) In which EC(Cn) The calculation formula of (a) is as follows:
Figure FDA0002594070220000011
Figure FDA0002594070220000012
wherein N is the total number of residue pairs,
Figure FDA0002594070220000013
is the confidence of the contact of the kth residue in the residue contact map to the (i, j),
Figure FDA0002594070220000014
is the distance between the kth residue pair (i, j) in the target protein, dconIs a threshold value of 8, EC1(Cn) Representing contact energy calculated from residue contact maps predicted by the TripletRes server, EC2(Cn) Representing contact energy calculated from residue contact maps predicted by MetaPSICOV servers, EC3(Cn) Representing the calculated contact energy, E, from a residue contact map predicted by a Raptorx serverC4(Cn) Represents the Contact energy calculated from the residue Contact map predicted by the STOP-Contact server;
6.3) according to Etotal(Cn) Sequencing the individuals in the population from low to high, taking the first NP individuals as an initialization population, Etotal(Cn) The calculation formula of (a) is as follows:
Figure FDA0002594070220000021
Figure FDA0002594070220000022
Figure FDA0002594070220000023
Figure FDA0002594070220000024
7) conformational recombination, assembly, as follows:
7.1)Cnfor the nth individual in the population and selecting two conformations C in the population which are different from the target conformationn1,Cn2Three different fragments were randomly selected in these two conformations, replacing conformation CnFragment of the corresponding position in the sequence to generate a new recombinant conformation C'n
7.2) according to each conformation C 'in the starting population'nAssembly of fragments 3 and 9 to give a new conformation C ″n
7.3) if score3 (C'n)>score3(C″n) Then, the conformation C ″, is usednSubstitute for C'nOtherwise, receiving the new conformation by using the Boltzmann probability to generate new NP individuals of the child population, forming a new population together with the NP individuals in the parent population, wherein the new population is 2NP individuals, and realizing the update of the new population;
8) performing population updating operation, repeating the steps 6.2) to 6.3), and selecting front NP individuals as new populations;
9) setting G to G +1, and if G > G, executing step 10); otherwise, go to 7);
10) the first conformation in the G-th generation is output as the final predicted result.
CN202010704125.8A 2020-07-21 2020-07-21 Residue contact map-based multi-objective optimization protein structure prediction method Pending CN112085244A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010704125.8A CN112085244A (en) 2020-07-21 2020-07-21 Residue contact map-based multi-objective optimization protein structure prediction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010704125.8A CN112085244A (en) 2020-07-21 2020-07-21 Residue contact map-based multi-objective optimization protein structure prediction method

Publications (1)

Publication Number Publication Date
CN112085244A true CN112085244A (en) 2020-12-15

Family

ID=73735351

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010704125.8A Pending CN112085244A (en) 2020-07-21 2020-07-21 Residue contact map-based multi-objective optimization protein structure prediction method

Country Status (1)

Country Link
CN (1) CN112085244A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108846256A (en) * 2018-06-07 2018-11-20 浙江工业大学 A kind of group's Advances in protein structure prediction based on contact residues information
CN109033744A (en) * 2018-06-19 2018-12-18 浙江工业大学 A kind of Advances in protein structure prediction based on residue distance and contact information
CN109346128A (en) * 2018-08-01 2019-02-15 浙江工业大学 A kind of Advances in protein structure prediction based on residue information dynamic select strategy
CN110148437A (en) * 2019-04-16 2019-08-20 浙江工业大学 A kind of Advances in protein structure prediction that contact residues auxiliary strategy is adaptive
CN111180004A (en) * 2019-11-29 2020-05-19 浙江工业大学 Multi-contact information sub-population strategy protein structure prediction method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108846256A (en) * 2018-06-07 2018-11-20 浙江工业大学 A kind of group's Advances in protein structure prediction based on contact residues information
CN109033744A (en) * 2018-06-19 2018-12-18 浙江工业大学 A kind of Advances in protein structure prediction based on residue distance and contact information
CN109346128A (en) * 2018-08-01 2019-02-15 浙江工业大学 A kind of Advances in protein structure prediction based on residue information dynamic select strategy
CN110148437A (en) * 2019-04-16 2019-08-20 浙江工业大学 A kind of Advances in protein structure prediction that contact residues auxiliary strategy is adaptive
CN111180004A (en) * 2019-11-29 2020-05-19 浙江工业大学 Multi-contact information sub-population strategy protein structure prediction method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
李章维;余宝昆;胡俊;周晓根;张贵军;: "接触图辅助的过程重采样蛋白质构象空间优化算法", 小型微型计算机系统, no. 03, 15 March 2020 (2020-03-15) *
谢腾宇;周晓根;胡俊;张贵军;: "基于接触图残基对距离约束的蛋白质结构预测算法", 计算机科学, no. 01, 19 September 2019 (2019-09-19) *

Similar Documents

Publication Publication Date Title
Wang et al. Analysis of deep learning methods for blind protein contact prediction in CASP12
Purzycka et al. Automated 3D RNA structure prediction using the RNAComposer method for riboswitches1
CN106503484A (en) A kind of multistage differential evolution Advances in protein structure prediction that is estimated based on abstract convex
CN103473482B (en) Based on the prediction method for three-dimensional structure of protein that differential evolution and conformational space are annealed
CN108846256B (en) Group protein structure prediction method based on residue contact information
Escamilla-Gutiérrez et al. In silico strategies for modeling RNA aptamers and predicting binding sites of their molecular targets
CN109360596B (en) Protein conformation space optimization method based on differential evolution local disturbance
CN109360599B (en) Protein structure prediction method based on residue contact information cross strategy
CN106096326B (en) A kind of differential evolution Advances in protein structure prediction based on barycenter Mutation Strategy
CN109086566B (en) Group protein structure prediction method based on fragment resampling
CN109215732A (en) A kind of Advances in protein structure prediction based on contact residues information self study
Lo et al. Conformational epitope matching and prediction based on protein surface spiral features
CN109872770B (en) Variable strategy protein structure prediction method combined with displacement degree evaluation
CN106503486A (en) A kind of differential evolution protein structure ab initio prediction method based on multistage subgroup coevolution strategy
CN111180004B (en) Multi-contact information sub-population strategy protein structure prediction method
CN112085244A (en) Residue contact map-based multi-objective optimization protein structure prediction method
CN109360597B (en) Group protein structure prediction method based on global and local strategy cooperation
CN109378034B (en) Protein prediction method based on distance distribution estimation
CN109300506B (en) Protein structure prediction method based on specific distance constraint
CN109509510B (en) Protein structure prediction method based on multi-population ensemble variation strategy
CN109346126B (en) Adaptive protein structure prediction method of lower bound estimation strategy
CN109243526B (en) Protein structure prediction method based on specific fragment crossing
CN109147867B (en) Group protein structure prediction method based on dynamic segment length
CN109360600B (en) Protein structure prediction method based on residue characteristic distance
CN109448785B (en) Protein structure prediction method for enhancing Loop region structure by using Laplace graph

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination