CN110556161B - Protein structure prediction method based on conformational diversity sampling - Google Patents

Protein structure prediction method based on conformational diversity sampling Download PDF

Info

Publication number
CN110556161B
CN110556161B CN201910743293.5A CN201910743293A CN110556161B CN 110556161 B CN110556161 B CN 110556161B CN 201910743293 A CN201910743293 A CN 201910743293A CN 110556161 B CN110556161 B CN 110556161B
Authority
CN
China
Prior art keywords
population
individuals
individual
diversity
protein
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910743293.5A
Other languages
Chinese (zh)
Other versions
CN110556161A (en
Inventor
张贵军
赵凯龙
饶亮
夏瑜豪
刘俊
彭春翔
周晓根
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Zhaoji Biotechnology Co ltd
Shenzhen Xinrui Gene Technology Co ltd
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN201910743293.5A priority Critical patent/CN110556161B/en
Publication of CN110556161A publication Critical patent/CN110556161A/en
Application granted granted Critical
Publication of CN110556161B publication Critical patent/CN110556161B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/126Evolutionary algorithms, e.g. genetic algorithms or genetic programming
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Theoretical Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Physiology (AREA)
  • Data Mining & Analysis (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • Biotechnology (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Peptides Or Proteins (AREA)

Abstract

A protein structure prediction method based on conformational diversity sampling predicts a protein structure by adopting a method of combining a genetic algorithm and a local search strategy, and samples population conformations by adopting an energy function and the diversity calculation method of the invention in the third and fourth stages of Rosetta. The conformation with larger difference of local structures is preferentially sampled, the prediction efficiency and precision are improved, and blind sampling is avoided to a certain extent. The invention provides a protein structure prediction method based on conformational diversity sampling, which has high prediction precision.

Description

Protein structure prediction method based on conformational diversity sampling
Technical Field
The invention relates to the fields of bioinformatics and computer application, in particular to a protein structure prediction method based on conformational diversity sampling.
Background
The problem of protein structure prediction is also known as the protein folding problem. The shape of the protein folding structure largely determines the biological function, and understanding the structural information of the protein is of great significance for studying the function of the protein. Prediction of the three-dimensional structure of proteins has become one of the important research issues in bioinformatics.
The de novo protein structure prediction method is a commonly used protein structure prediction method, and is also an ideal prediction method because it only uses primary sequence information for prediction and does not depend on a known protein structure template. The theoretical basis for de novo protein structure prediction is that the three-dimensional structure of the native protein in a certain environment is the structure with the least free energy of the whole system. Thus, there are two keys to de novo protein structure prediction: firstly, a reasonable potential function is required, and the global minimum point of the potential function corresponds to the natural structure of the protein; and secondly, an efficient conformational space search algorithm is required to ensure that the global minimum of the potential function is found in effective calculation time. In the process of de novo prediction of protein structure, the inaccuracy of energy function and the lack of sampling ability cause the prediction result to be not ideal.
Over the past decades, researchers have proposed many algorithms to solve the problem of searching for a globally optimal solution to the problem of predicting the three-dimensional structure of proteins. Genetic algorithms have long been used in protein structure prediction because of their ability to find optimal solutions simply and efficiently in large and complex search spaces. Because the genetic algorithm has the defects of easy falling into local optimum, premature phenomenon and slow convergence rate of the algorithm, most methods adopt a method of combining the genetic algorithm and a local search strategy to predict the protein structure. For example, a method combining a genetic algorithm and a simulated annealing algorithm can effectively avoid falling into a local optimal solution. The taboo algorithm is applied to the genetic algorithm, and the structure of the protein is quickly and accurately searched. However, these methods combine multiple algorithms, have long running time and low efficiency, and have certain limitations.
Disclosure of Invention
In order to overcome the defect of low sampling efficiency of the conventional protein structure prediction method, the invention provides a novel diversity calculation method. The invention adopts a method of combining a genetic algorithm and a local search strategy to predict the protein structure, and adopts an energy function to combine with the diversity calculation method of the invention to sample after the cross variation of population individuals. The diversity calculation method of the invention can avoid blindly sampling the protein conformation space. After the first-stage and second-stage fragment assembly of the Rosetta protocol is carried out, the general structure of the protein is predicted, and on the basis, the conformation with larger difference of local structures is preferentially sampled and then the fragment assembly is carried out, so that the algorithm is prevented from falling into local optimum, and the prediction efficiency and precision are improved.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a method for predicting protein structure based on conformational diversity sampling, the method comprising the steps of:
1) inputting sequence information of a predicted protein, and reading the sequence length L; setting parameters: population N, iteration number G, cross probability Pc
2) According to the sequence information of the target protein, a fragment library is constructed by utilizing Robeta (http:// robeta. bakerlab. org), and the secondary structure information of the target sequence is predicted by utilizing PSIPRED (http:// bioinf. cs. ucl. ac. uk/psiprd);
3) Iterating the first and second stages of Rosetta to generate an initial population with N individuals
Figure GDA0003267347870000021
4) Setting G to 0, wherein G belongs to {0,1,2, ·, G };
5) if g is 0, segment assembling is carried out on individuals in the population, and a new population P is generated1,P2,...,PN};
6) Respectively executing the steps 7) to 11) based on the third and fourth phases of the Rosetta protocol;
7) randomly pairing individuals in the population pairwise to form N/2 male parent pairs;
8) the cross operation, the process is as follows:
8.1) setting P1 *、P2 *Randomly selecting a loop area for two male parent individuals;
8.2) generating a random decimal r1,r1∈[0,1]If r is1<PcExchange P for1 *、P2 *All residues of the selected loop region are dihedral to generate two new individuals P'1、P′2
8.3) iterating steps 8.1) and 8.2) until all male parent pairs are crossed, generating a new population P '═ P'1,P′2,...P′N};
9) Mutation operation, the process is as follows:
9.1) to individuals P 'in the population P'iGenerating a random integer r2,r2∈[0,L-3]Randomly selecting a fragment from the corresponding 3-fragment library for replacement;
9.2) iterating step 9.1) until all individuals have completed variation, generating a new population P ″ ═ P ″1 ,P″2,...P″N};
10) Selecting operation, the process is as follows:
10.1) generating a random decimal rb,rb∈[0,1]If r isbIf the energy is less than 0.5, the individuals in the parent population P and the offspring population P' are scored by using an energy function, the individuals are sorted from low to high according to the energy, and the first N individuals with low energy are selected as the next generation population; otherwise, executing step 10.2);
10.2) the diversity of all individuals in the parent population P and the offspring population P' is calculated as follows:
diversity(Ci)=max{RMSEdif(Ci,Cj)|Cj∈{P∪P″},Ci≠Cj}
wherein
Figure GDA0003267347870000031
RMSE′atoRepresents an individual CiAnd individual CjThe amino acid sequence is 0-L/2 of the similarity of corresponding structures, RMSEatoRepresents an individual CiAnd individual CjSimilarity of corresponding structures of which the amino acid sequences are L/2-L;
Figure GDA0003267347870000032
C′iand C'jRepresents an individual CiAnd individual CjThe amino acid sequence is a structure corresponding to 0-L/2;
Figure GDA0003267347870000033
C″iand C ″)jRepresents an individual CiAnd individual CjThe amino acid sequence is a structure corresponding to L/2-L;
Figure GDA0003267347870000034
and
Figure GDA0003267347870000035
are respectively C'iAnd C'jThe three-dimensional coordinates of the ith atom in the population are determined, L is the sequence length of the structure, and finally, the individuals are sorted from high to low according to the diversity size, and the first N individuals with the maximum diversity are selected as the next generation of population;
11) g is G +1, if G is less than or equal to G, the step 7) is carried out, otherwise, the circulation is ended;
12) and outputting a prediction result.
The invention has the beneficial effects that: the protein structure is predicted by adopting a method of combining a genetic algorithm and a local search strategy, and sampling is carried out by combining an energy function and the diversity calculation method after the population individuals are subjected to cross variation. Individuals with large local structure difference are preferentially sampled, the prediction efficiency and precision are improved, and blind sampling is avoided to a certain extent.
Drawings
FIG. 1 is a schematic diagram of structure prediction of protein 1ELWA by a protein structure prediction method based on conformational diversity sampling.
FIG. 2 is a three-dimensional structural diagram of a protein 1ELWA based on a protein structure prediction method of conformational diversity sampling.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
Referring to fig. 1 and 2, a method for optimizing the individual space of a protein based on conformational diversity sampling comprises the following steps:
1) inputting the sequence information of the predicted protein,reading the sequence length L; setting parameters: population N, iteration number G, cross probability Pc
2) According to the sequence information of the target protein, a fragment library is constructed by utilizing Robeta (http:// robeta. bakerlab. org), and the secondary structure information of the target sequence is predicted by utilizing PSIPRED (http:// bioinf. cs. ucl. ac. uk/psiprd);
3) iterating the first and second stages of Rosetta to generate an initial population with N individuals
Figure GDA0003267347870000041
4) Setting G to 0, wherein G belongs to {0,1,2, ·, G };
5) if g is 0, segment assembling is carried out on individuals in the population, and a new population P is generated1,P2,...,PN};
6) Respectively executing the steps 7) to 11) based on the third and fourth phases of the Rosetta protocol;
7) Randomly pairing individuals in the population pairwise to form N/2 male parent pairs;
8) the cross operation, the process is as follows:
8.1) setting P1 *、P2 *Randomly selecting a loop area for two male parent individuals;
8.2) generating a random decimal r1,r1∈[0,1]If r is1<PcExchange P for1 *、P2 *All residues of the selected loop region are dihedral to generate two new individuals P'1、P′2
8.3) iterating steps 8.1) and 8.2) until all male parent pairs are crossed to complete to generate a new population
P′={P′1,P′2,...P′N};
9) Mutation operation, the process is as follows:
9.1) to individuals P 'in the population P'iGenerating a random integer r2,r2∈[0,L-3]Randomly selecting a fragment from the corresponding 3-fragment library for replacement;
9.2) iterating step 9.1) until all individuals have completed variation, generating a new population P ″ ═ P ″1 ,P″2,...P″N};
10) Selecting operation, the process is as follows:
10.1) generating a random decimal rb,rb∈[0,1]If r isbIf the energy is less than 0.5, the individuals in the parent population P and the offspring population P' are scored by using an energy function, the individuals are sorted from low to high according to the energy, and the first N individuals with low energy are selected as the next generation population; otherwise, executing step 10.2);
10.2) the diversity of all individuals in the parent population P and the offspring population P' is calculated as follows:
diversity(Ci)=max{RMSEdif(Ci,Cj)|Cj∈{P∪P″},Ci≠Cj}
wherein
Figure GDA0003267347870000051
RMSE′atoRepresents an individual C iAnd individual CjThe amino acid sequence is 0-L/2 of the similarity of corresponding structures, RMSEatoRepresents an individual CiAnd individual CjSimilarity of corresponding structures of which the amino acid sequences are L/2-L;
Figure GDA0003267347870000052
C′iand C'jRepresents an individual CiAnd individual CjThe amino acid sequence is a structure corresponding to 0-L/2;
Figure GDA0003267347870000053
C″iand C ″)jRepresents an individual CiAnd individual CjThe amino acid sequence is a structure corresponding to L/2-L;
Figure GDA0003267347870000054
and
Figure GDA0003267347870000055
respectively represent Ci' and Cj' the three-dimensional coordinates of the ith atom, L is the length of the sequence of the structure. Finally, sorting the individuals according to the diversity size from high to low, and selecting the first N individuals with the maximum diversity as a next generation population;
11) g is G +1, if G is less than or equal to G, the step 7) is carried out, otherwise, the circulation is ended;
12) and outputting a prediction result.
In this embodiment, taking protein 1ELWA with a sequence length of 117 as an example, a method for optimizing the individual space of protein based on conformational diversity sampling includes the following steps:
1) inputting sequence information of a predicted protein, setting parameters for reading sequence length L as 117: the population N is 100, the iteration number G is 10, and the cross probability P isc=0.5;
2) According to the sequence information of the target protein, a fragment library is constructed by utilizing Robeta (http:// robeta. bakerlab. org), and the secondary structure information of the target sequence is predicted by utilizing PSIPRED (http:// bioinf. cs. ucl. ac. uk/psiprd);
3) Iterating the first and second stages of Rosetta to generate an initial population with N individuals
Figure GDA0003267347870000056
4) Setting g to be 0;
5) if g is 0, segment assembling is carried out on individuals in the population, and a new population P is generated1,P2,...,PN};
6) Respectively executing the steps 7) to 11) based on the third and fourth phases of the Rosetta protocol;
7) randomly pairing individuals in the population pairwise to form N/2 male parent pairs;
8) the cross operation, the process is as follows:
8.1) setting P1 *、P2 *Is two fathersRandomly selecting a loop area for the individual;
8.2) generating a random decimal r1,r1∈[0,1]If r is1<PcExchange P for1 *、P2 *All residues of the selected loop region are dihedral to generate two new individuals P'1、P′2
8.3) iterating steps 8.1) and 8.2) until all male parent pairs are crossed, generating a new population P '═ P'1,P′2,...P′N};
9) Mutation operation, the process is as follows:
9.1) to individuals P 'in the population P'iGenerating a random integer r2,r2∈[0,L-3]Randomly selecting a fragment from the corresponding 3-fragment library for replacement;
9.2) iterating step 9.1) until all individuals have completed variation, generating a new population P ″ ═ P ″1,P″2,...P″N}; 10) Selecting operation, the process is as follows:
10.1) generating a random decimal rb,rb∈[0,1]If r isbLess than 0.5, using energy CiThe function scores the individuals in the parent population P and the offspring population P', the individuals are sorted from low to high according to energy, and the first N individuals with low energy are selected as the next generation population; otherwise, executing step 10.2);
10.2) the diversity of all individuals in the parent population P and the offspring population P' is calculated as follows:
diversity(Ci)=max{RMSEdif(Ci,Cj)|Cj∈{P∪P″},Ci≠Cj}
wherein
Figure GDA0003267347870000061
RMSE′atoRepresenting individuals and individuals CjThe amino acid sequence is 0-L/2 of the similarity of corresponding structures, RMSEatoRepresents an individual CiAnd individual CjThe amino acid sequence is L/2-L corresponds to structural similarity;
Figure GDA0003267347870000062
C′iand C'jRepresents an individual CiAnd individual CjThe amino acid sequence is a structure corresponding to 0-L/2;
Figure GDA0003267347870000063
C″iand C ″)jRepresents an individual CiAnd individual CjThe amino acid sequence is a structure corresponding to L/2-L;
Figure GDA0003267347870000071
and
Figure GDA0003267347870000072
respectively represent Ci' and Cj' the three-dimensional coordinates of the ith atom, L is the length of the sequence of the structure. Finally, sorting the individuals according to the diversity size from high to low, and selecting the first N individuals with the maximum diversity as a next generation population;
11) g is G +1, if G is less than or equal to G, the step 7) is carried out, otherwise, the circulation is ended;
12) and outputting a prediction result.
Using the protein 1ELWA having an amino acid sequence length of 117 as an example, a near-natural individual of the protein was obtained by the above method, and the predicted root mean square deviation of the protein was
Figure GDA0003267347870000073
As shown in fig. 1, the prediction structure is shown in fig. 2.
The foregoing is a predictive effect of one embodiment of the invention, which may be adapted not only to the above-described embodiment, but also to various modifications thereof without departing from the basic idea of the invention and without exceeding the gist of the invention.

Claims (1)

1. A method for optimizing the individual space of a protein based on conformational diversity sampling, the method comprising the steps of:
1) inputting sequence information of a predicted protein, and reading the sequence length L; setting parameters: population N, iteration number G, cross probability Pc
2) According to the sequence information of the target protein, a fragment library is constructed by utilizing Robeta, and the secondary structure information of the target sequence is predicted by utilizing PSIPRED;
3) iterating the first and second stages of Rosetta to generate an initial population with N individuals
Figure FDA0003458826350000011
4) Setting G to 0, wherein G belongs to {0,1,2, ·, G };
5) if g is 0, segment assembling is carried out on individuals in the population, and a new population P is generated1,P2,...,PN};
6) Respectively executing the steps 7) to 11) based on the third and fourth phases of the Rosetta protocol;
7) randomly pairing individuals in the population pairwise to form N/2 male parent pairs;
8) the cross operation, the process is as follows:
8.1) setting P1 *、P2 *Randomly selecting a loop area for two male parent individuals;
8.2) generating a random decimal r1,r1∈[0,1]If r is1<PcExchange P for1 *、P2 *All residues of the selected loop region are dihedral to generate two new individuals P'1、P′2
8.3) iterating steps 8.1) and 8.2) until all male parent pairs are crossed, generating a new population P '═ P' 1,P′2,...P′N};
9) Mutation operation, the process is as follows:
9.1) to individuals P 'in the population P'iGenerating a random integer r2,r2∈[0,L-3]Randomly selecting one fragment from the corresponding 3 residual fragment libraries for replacement;
9.2) iterating step 9.1) until all individuals have completed variation, generating a new population P ″ ═ P ″1,P″2,...P″N};
10) Selecting operation, the process is as follows:
10.1) generating a random decimal rb,rb∈[0,1]If r isbIf the energy is less than 0.5, the individuals in the parent population P and the offspring population P' are scored by using an energy function, the individuals are sorted from low to high according to the energy, and the first N individuals with low energy are selected as the next generation population; otherwise, executing step 10.2);
10.2) the diversity of all individuals in the parent population P and the offspring population P' is calculated as follows:
diversity(Ci)=max{RMSEdif(Ci,Cj)|Cj∈{P∪P″},Ci≠Cj}
wherein
Figure FDA0003458826350000021
RMSE′atoRepresents an individual CiAnd individual CjThe amino acid sequence is 0-L/2 of the similarity of corresponding structures, RMSEatoRepresents an individual CiAnd individual CjSimilarity of corresponding structures of which the amino acid sequences are L/2-L;
Figure FDA0003458826350000022
C′iand C'jRepresents an individual CiAnd individual CjThe amino acid sequence is a structure corresponding to 0-L/2;
Figure FDA0003458826350000023
C″iand C ″)jRepresents an individual CiAnd individual CjThe amino acid sequence is a structure corresponding to L/2-L;
Figure FDA0003458826350000024
and
Figure FDA0003458826350000025
are respectively C'iAnd C'jThe three-dimensional coordinates of the ith atom in the population are determined, L is the sequence length of the structure, and finally, the individuals are sorted from high to low according to the diversity size, and the first N individuals with the maximum diversity are selected as the next generation of population;
11) G is G +1, if G is less than or equal to G, the step 7) is carried out, otherwise, the circulation is ended;
and outputting a prediction result.
CN201910743293.5A 2019-08-13 2019-08-13 Protein structure prediction method based on conformational diversity sampling Active CN110556161B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910743293.5A CN110556161B (en) 2019-08-13 2019-08-13 Protein structure prediction method based on conformational diversity sampling

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910743293.5A CN110556161B (en) 2019-08-13 2019-08-13 Protein structure prediction method based on conformational diversity sampling

Publications (2)

Publication Number Publication Date
CN110556161A CN110556161A (en) 2019-12-10
CN110556161B true CN110556161B (en) 2022-04-05

Family

ID=68737340

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910743293.5A Active CN110556161B (en) 2019-08-13 2019-08-13 Protein structure prediction method based on conformational diversity sampling

Country Status (1)

Country Link
CN (1) CN110556161B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2018203197A1 (en) * 2010-07-23 2018-05-31 President And Fellows Of Harvard College Methods of detecting autoimmune or immune-related diseases or conditions
CN108334746A (en) * 2018-01-15 2018-07-27 浙江工业大学 A kind of Advances in protein structure prediction based on secondary structure similarity
CN108647486A (en) * 2018-03-22 2018-10-12 浙江工业大学 A kind of prediction method for three-dimensional structure of protein based on conformational diversity strategy
CN109411013A (en) * 2018-08-29 2019-03-01 浙江工业大学 A kind of group's Advances in protein structure prediction based on the specific Mutation Strategy of individual

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11574702B2 (en) * 2017-12-20 2023-02-07 George Mason University Mining all atom simulations for diagnosing and treating disease

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2018203197A1 (en) * 2010-07-23 2018-05-31 President And Fellows Of Harvard College Methods of detecting autoimmune or immune-related diseases or conditions
CN108334746A (en) * 2018-01-15 2018-07-27 浙江工业大学 A kind of Advances in protein structure prediction based on secondary structure similarity
CN108647486A (en) * 2018-03-22 2018-10-12 浙江工业大学 A kind of prediction method for three-dimensional structure of protein based on conformational diversity strategy
CN109411013A (en) * 2018-08-29 2019-03-01 浙江工业大学 A kind of group's Advances in protein structure prediction based on the specific Mutation Strategy of individual

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Conformational diversity analysis reveals three functional mechanisms in proteins";Monzon A M;《PLOS Computational Biology》;20170213;第1-18页 *
"蛋白质结构从头预测多级个体筛选进化算法";李章维;《计算机科学》;20190630;第80-84页 *

Also Published As

Publication number Publication date
CN110556161A (en) 2019-12-10

Similar Documents

Publication Publication Date Title
Deng et al. Protein structure prediction
Xu et al. Toward optimal fragment generations for ab initio protein structure assembly
CN103473482B (en) Based on the prediction method for three-dimensional structure of protein that differential evolution and conformational space are annealed
CN108846256B (en) Group protein structure prediction method based on residue contact information
CN115458039B (en) Method and system for predicting single-sequence protein structure based on machine learning
CN109360599B (en) Protein structure prediction method based on residue contact information cross strategy
CN113593634B (en) Transcription factor binding site prediction method fusing DNA shape characteristics
CN104951670B (en) A kind of colony's conformational space optimization method based on distance spectrum
CN110556161B (en) Protein structure prediction method based on conformational diversity sampling
CN109378034B (en) Protein prediction method based on distance distribution estimation
CN108595910B (en) Group protein conformation space optimization method based on diversity index
CN109300506B (en) Protein structure prediction method based on specific distance constraint
Eghdami et al. BetaProbe: A probability based method for predicting beta sheet topology using integer programming
CN109300505B (en) Protein structure prediction method based on biased sampling
CN112967751A (en) Protein conformation space optimization method based on evolution search
CN109360600B (en) Protein structure prediction method based on residue characteristic distance
CN110444249B (en) Method for predicting fluorescent protein based on calculation
CN109147867B (en) Group protein structure prediction method based on dynamic segment length
CN109767808B (en) Novel protein evolution simulation model based on cellular automaton
Ahmad et al. Improved performance of lysine glutarylation PTM using peptide evolutionary features
CN109390035B (en) Protein conformation space optimization method based on local structure comparison
CN109243526B (en) Protein structure prediction method based on specific fragment crossing
CN108563921B (en) Protein structure prediction algorithm evaluation index construction method
JP2019095819A (en) Information processing device and program
CN109658979B (en) Protein structure prediction method based on fragment library information enhancement

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20231219

Address after: 518054, D1101, Building 4, Software Industry Base, No. 19, 17, and 18 Haitian 1st Road, Binhai Community, Yuehai Street, Nanshan District, Shenzhen City, Guangdong Province

Patentee after: Shenzhen Xinrui Gene Technology Co.,Ltd.

Address before: 510075 No. n2248, floor 3, Xingguang Yingjing, No. 117, Shuiyin Road, Yuexiu District, Guangzhou City, Guangdong Province

Patentee before: GUANGZHOU ZHAOJI BIOTECHNOLOGY CO.,LTD.

Effective date of registration: 20231219

Address after: 510075 No. n2248, floor 3, Xingguang Yingjing, No. 117, Shuiyin Road, Yuexiu District, Guangzhou City, Guangdong Province

Patentee after: GUANGZHOU ZHAOJI BIOTECHNOLOGY CO.,LTD.

Address before: The city Zhaohui six districts Chao Wang Road Hangzhou City, Zhejiang province 310014 18

Patentee before: JIANG University OF TECHNOLOGY