CN108334746B - Protein structure prediction method based on secondary structure similarity - Google Patents

Protein structure prediction method based on secondary structure similarity Download PDF

Info

Publication number
CN108334746B
CN108334746B CN201810034686.4A CN201810034686A CN108334746B CN 108334746 B CN108334746 B CN 108334746B CN 201810034686 A CN201810034686 A CN 201810034686A CN 108334746 B CN108334746 B CN 108334746B
Authority
CN
China
Prior art keywords
population
secondary structure
individual
individuals
similarity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810034686.4A
Other languages
Chinese (zh)
Other versions
CN108334746A (en
Inventor
李章维
孙科
余宝昆
马来发
周晓根
张贵军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Zhaoji Biotechnology Co ltd
Shenzhen Xinrui Gene Technology Co ltd
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN201810034686.4A priority Critical patent/CN108334746B/en
Publication of CN108334746A publication Critical patent/CN108334746A/en
Application granted granted Critical
Publication of CN108334746B publication Critical patent/CN108334746B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment

Landscapes

  • Spectroscopy & Molecular Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A protein conformation space search method based on secondary structure similarity, firstly, the setting of cross probability in a group algorithm can not only control the speed of group convergence, avoid precocity, but also enable information interaction among groups; then, mutation can increase the diversity of conformation to obtain better conformation; and finally, in the selection process, the population is preferred by using the secondary structure similarity, individuals with small secondary structure similarity are eliminated, and better individuals are left, so that the problem of inaccuracy of an energy function is avoided. The invention has better sampling capability and higher prediction precision.

Description

Protein structure prediction method based on secondary structure similarity
Technical Field
The invention relates to the fields of biological informatics, molecular dynamics simulation, statistical learning and combination optimization and computer application, in particular to a protein structure prediction method based on secondary structure similarity.
Background
Since the end of the twentieth century, with the rapid development of the field of life sciences, more and more researchers and research institutes have participated in the research of life sciences. The protein is formed by dehydration and condensation of 20 different amino acids by taking mRNA as a template to form an amino acid sequence, and then is folded to form a three-dimensional structure with a specific function. The secondary structure of a protein refers to a regularly repeating conformation, such as alpha helices and beta sheets, in the polypeptide chain of the protein. The understanding of the three-dimensional structure of the protein is the basis for researching the biological function and activity mechanism of the protein, and the problem of protein structure prediction is one of the research hotspots in the fields of bioinformatics and computer application at present, and has very important guiding significance for the invention of new protein and the design of drug target protein. The three-dimensional structure of proteins is now determined experimentally mainly by methods such as X-ray diffraction and Nuclear Magnetic Resonance (NMR). The X-ray diffraction technique can obtain a protein structure with high precision, but the method is not suitable for the protein which can not prepare resolution crystals. The nuclear magnetic resonance method does not need to prepare protein crystals, but only can measure small proteins with less than 300 amino acids, and is long in time consumption and high in cost.
Since the rate of protein structure determination is much slower than that of sequence determination, in fact, only 0.2% of the protein sequences possess the experimentally determined protein structure, it is a very meaningful work to predict the structure from the protein sequence using computer methods. The Anfinsen experiments show that structural information of the protein is contained in its sequence, thereby indicating that structural prediction is feasible from the sequence. The mainstream protein structure prediction methods at present mainly comprise a homologous modeling method, a threading method and a de novo prediction method. For the case of high sequence identity (> 50%), homology modeling and threading are the first choice, but for the case of low sequence similarity (< 30%), the first two methods are no longer applicable, and only de novo prediction can be selected.
In the process of protein structure de novo prediction, two major bottlenecks exist at present, one is deceptiveness of energy landscape, so that the obtained energy low conformation is not a natural conformation, and is specifically represented as inaccurate energy function, and a good conformation cannot be selected; another is the lack of ability of the prior art to sample conformational space, which is manifested by a lack of diversity in the conformations.
Therefore, the current protein structure prediction method has defects in prediction accuracy and sampling capability, and needs to be improved.
Disclosure of Invention
In order to overcome the defects of insufficient sampling capability and prediction accuracy of the conventional protein structure prediction method, the invention provides a group protein structure prediction method based on secondary structure similarity, which has better sampling capability and higher prediction accuracy, designs a secondary structure similarity index, selects individuals with better energy and structure through the double constraints of the secondary structure similarity index and an energy function, and effectively solves the problem of low protein structure prediction accuracy caused by inaccuracy of the energy function.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a method for predicting protein structure based on secondary structure similarity, the method comprising the steps of: 1) setting parameters, and the process is as follows:
reading sequence information of a target protein, fragment library information, and an energy function, and setting population position { x } of a protein conformation1,x2,...,xi,...,xNPWhere NP is the population size, xiRepresenting the ith individual of the population, with Gen iteration number and G maximum iteration numbermaxCross probability p1Probability of variation p2The sequence length is L;
2) calculating the similarity of the secondary structure by the following process:
for a protein x of sequence length LiThe secondary structure of the prediction obtained by the PSIPRED online server is
Figure BDA0001547271050000021
k∈[1,L]The secondary structure of the conformation obtained by the Dssp algorithm is
Figure BDA0001547271050000022
k∈[1,L]According to the formula
Figure BDA0001547271050000023
Calculating to obtain the secondary structure similarity of the conformation, recording the score as 1 when the secondary structures of the k-th position and the k-th position are the same, otherwise, recording the score as 0, traversing the whole sequence length to obtain the final score, and dividing the final score by the sequence length to obtain the similarity SS of the secondary structurei
3) Population initialization, the process is as follows:
performing fragment assembly on individuals of the population until residue positions of all positions are replaced once, and finishing initialization operation;
4) population crossing, the process is as follows:
4.1) pairing NP individuals pairwise to form NP/2 pairs, and numbering the NP/2 pairs1,a2,...,aj,...,aNP2,j∈[1,NP/2];
4.2) randomly selecting one of the groups ajAccording to the probability p1Judging whether to carry out crossing, if so, randomly selecting a certain position of the individual, and setting the random crossing length as crosslength E [3,10 ]]Exchanging fragments of the two individuals to obtain two new individuals;
5) population variation, the process is as follows:
5.1) for individual xi,i∈[1,NP]Using the probability of variation p2Fragment assembly is carried out if random (0,1) > p2Then a fragment assembly of 3 fragments is performed, if random (0,1) < p2Then, segment assembly of 9 segments is carried out, and the individual x is obtained after segment assemblyi′;
5.2) Using energy function to group Individual x before and after AssemblyiAnd xi' separately calculating energy to obtain EiAnd Ei', if Ei′<EiThen the individual x is retainedi'; if Ei′>EiThen, according to Boltzmann probability p ═ exp { - (E)i′-Ei) KT }, to judge whether to receive the assembled individual, if random (0,1) < p, then keep the individual xi'; otherwise, individual x is retainedi
6) And (3) selecting the population by using the secondary structure similarity, wherein the process is as follows:
firstly, merging an initial population and a varied population into a new population with the population size of 2 × NP, then calculating the secondary structure similarity of individuals of the new population, sequencing the merged population according to the level of the secondary structure similarity, selecting the individuals with high primary NP secondary structure similarity as the selected population individuals, and finally setting Gen + 1;
7) judging whether the maximum iteration number G is reachedmaxIf the conditions are met, stopping iteration and outputting the last generation populationVolume information, otherwise, returning to the step 4).
The technical conception of the invention is as follows: the invention provides a protein structure prediction method based on secondary structure similarity under the framework of a group algorithm. Firstly, the setting of the cross probability in the group algorithm can control the speed of group convergence, avoid precocity and also enable information interaction between groups; then, mutation can increase the diversity of conformation to obtain better conformation; and finally, in the selection process, the population is preferred by using the secondary structure similarity, individuals with small secondary structure similarity are eliminated, and better individuals are left, so that the problem of inaccuracy of an energy function is avoided.
The beneficial effects of the invention are as follows: on one hand, a group algorithm is used, information interaction is carried out among groups, and search of a conformation space is increased; on the other hand, the population is selected through the secondary structure similarity, so that the retention probability of high-quality individuals is greatly increased, errors caused by inaccuracy of energy functions are reduced, and the prediction accuracy is improved.
Drawings
FIG. 1 is a flow chart of a method for predicting protein structure based on secondary structure similarity.
FIG. 2 is a conformational distribution diagram obtained when protein 1AIL is subjected to structure prediction by a protein structure prediction method based on secondary structure similarity.
FIG. 3 is a three-dimensional structural diagram obtained by predicting the structure of protein 1AIL by a protein structure prediction method based on secondary structure similarity.
The invention is further described below with reference to the accompanying drawings.
Referring to fig. 1 to 3, a method for predicting a protein structure based on secondary structure similarity, the method comprising the steps of:
1) setting parameters, and the process is as follows:
reading sequence information of a target protein, fragment library information, and an energy function, and setting population position { x } of a protein conformation1,x2,...,xi,...,xNPWhere NP is the population size, xiRepresenting a populationThe ith individual of (1), the number of iterations is Gen, and the maximum number of iterations GmaxCross probability p1Probability of variation p2The sequence length is L;
2) calculating the similarity of the secondary structure by the following process:
for a protein x of sequence length LiThe secondary structure of the prediction obtained by the PSIPRED online server is
Figure BDA0001547271050000041
k∈[1,L]The secondary structure of the conformation obtained by the Dssp algorithm is
Figure BDA0001547271050000042
k∈[1,L]According to the formula
Figure BDA0001547271050000043
Calculating to obtain the secondary structure similarity of the conformation, recording the score as 1 when the secondary structures of the k-th position and the k-th position are the same, otherwise, recording the score as 0, traversing the whole sequence length to obtain the final score, and dividing the final score by the sequence length to obtain the similarity SS of the secondary structurei
3) Population initialization, the process is as follows:
performing fragment assembly on individuals of the population until residue positions of all positions are replaced once, and finishing initialization operation;
4) population crossing, the process is as follows:
4.1) pairing NP individuals pairwise to form NP/2 pairs, and numbering the NP/2 pairs1,a2,...,aj,...,aNP/2,j∈[1,NP/2];
4.2) randomly selecting one of the groups ajAccording to the probability p1Judging whether to carry out crossing, if so, randomly selecting a certain position of the individual, and setting the random crossing length as crosslength E [3,10 ]]Exchanging fragments of the two individuals to obtain two new individuals;
5) population variation, the process is as follows:
5.1) for individual xi,i∈[1,NP]Using the probability of variation p2Fragment assembly is carried out if random (0,1) > p2Then a fragment assembly of 3 fragments is performed, if random (0,1) < p2Then, segment assembly of 9 segments is carried out, and the individual x is obtained after segment assemblyi′;
5.2) Using energy function to group Individual x before and after AssemblyiAnd xi' separately calculating energy to obtain EiAnd Ei', if Ei′<EiThen the individual x is retainedi'; if Ei′>EiThen, according to Boltzmann probability p ═ exp { - (E)i′-Ei) KT }, to judge whether to receive the assembled individual, if random (0,1) < p, then keep the individual xi'; otherwise, individual x is retainedi
6) And (3) selecting the population by using the secondary structure similarity, wherein the process is as follows:
firstly, merging an initial population and a varied population into a new population with the population size of 2 × NP, then calculating the secondary structure similarity of individuals of the new population, sequencing the merged population according to the level of the secondary structure similarity, selecting the individuals with high primary NP secondary structure similarity as the selected population individuals, and finally setting Gen + 1;
7) judging whether the maximum iteration number G is reachedmaxAnd if the conditions are met, stopping iteration and outputting the information of the population individuals of the last generation, otherwise, returning to the step 4).
In this embodiment, an α -sheet protein 1AIL with a sequence length of 73 is taken as an example, and a method for predicting a protein structure based on a secondary structure similarity includes the following steps:
1) setting parameters, and the process is as follows:
reading sequence information of a target protein, fragment library information, an energy function "score 3", and setting population position { x } of a protein conformation1,x2,...,xi,...,xNPWhere NP-100 is the population size, xiThe ith individual representing the population, with Gen iterations and maximum iterationsNumber G max100, cross probability p10.1, probability of variation p20.5, sequence length L73;
2) calculating the similarity of the secondary structure by the following process:
for a protein x of sequence length LiThe secondary structure of the prediction obtained by the PSIPRED online server is
Figure BDA0001547271050000061
k∈[1,L]The secondary structure of the conformation obtained by the Dssp algorithm is
Figure BDA0001547271050000062
k∈[1,L]According to the formula
Figure BDA0001547271050000063
Calculating to obtain the secondary structure similarity of the conformation, recording the score as 1 when the secondary structures of the k-th position and the k-th position are the same, otherwise, recording the score as 0, traversing the whole sequence length to obtain the final score, and dividing the final score by the sequence length to obtain the similarity SS of the secondary structurei
3) Population initialization, the process is as follows:
performing fragment assembly on individuals of the population until residue positions of all positions are replaced once, and finishing initialization operation;
4) population crossing, the process is as follows:
4.1) pairing NP individuals pairwise to form NP/2 pairs, and numbering the NP/2 pairs1,a2,...,aj,...,aNP/2,j∈[1,NP/2];
4.2) randomly selecting one of the groups ajAccording to the probability p1Judging whether to carry out crossing, if so, randomly selecting a certain position of the individual, and setting the random crossing length as crosslength E [3,10 ]]Exchanging fragments of the two individuals to obtain two new individuals;
5) population variation, the process is as follows:
5.1) for individual xi,i∈[1,NP]Using the probability of variation p2Fragment assembly is carried out if random (0,1) > p2Then a fragment assembly of 3 fragments is performed, if random (0,1) < p2Then, segment assembly of 9 segments is carried out, and the individual x is obtained after segment assemblyi′;
5.2) Using energy function to group Individual x before and after AssemblyiAnd xi' separately calculating energy to obtain EiAnd Ei', if Ei′<EiThen the individual x is retainedi'; if Ei′>EiThen, according to Boltzmann probability p ═ exp { - (E)i′-Ei) KT }, to judge whether to receive the assembled individual, if random (0,1) < p, then keep the individual xi'; otherwise, individual x is retainedi
6) And (3) selecting the population by using the secondary structure similarity, wherein the process is as follows:
firstly, merging an initial population and a varied population into a new population with the population size of 2 × NP, then calculating the secondary structure similarity of individuals of the new population, sequencing the merged population according to the level of the secondary structure similarity, selecting the individuals with high primary NP secondary structure similarity as the selected population individuals, and finally setting Gen + 1;
7) judging whether the maximum iteration number G is reachedmaxAnd if the conditions are met, stopping iteration and outputting the information of the population individuals of the last generation, otherwise, returning to the step 4).
Using the method described above, the protein was obtained in a near-native conformation using the alpha-folded protein 1AIL with a sequence length of 73, the minimum RMS deviation being
Figure BDA0001547271050000071
Mean root mean square deviation of
Figure BDA0001547271050000072
The prediction structure is shown in fig. 3.
The above description is the optimization effect of the present invention using 1AIL protein as an example, and is not intended to limit the scope of the present invention, and various modifications and improvements can be made without departing from the scope of the present invention.

Claims (1)

1. A protein structure prediction method based on secondary structure similarity is characterized by comprising the following steps:
1) setting parameters, and the process is as follows:
reading sequence information of a target protein, fragment library information, and an energy function, and setting population position { x } of a protein conformation1,x2,...,xi,...,xNPWhere NP is the population size, xiRepresenting the ith individual of the population, with Gen iteration number and G maximum iteration numbermaxCross probability p1Probability of variation p2The sequence length is L;
2) calculating the similarity of the secondary structure by the following process:
for a protein x of sequence length LiThe secondary structure of the prediction obtained by the PSIPRED online server is
Figure FDA0002980167240000011
The secondary structure of the conformation obtained by the Dssp algorithm is
Figure FDA0002980167240000012
According to the formula
Figure FDA0002980167240000013
Calculating to obtain the secondary structure similarity of the conformation, recording the score as 1 when the secondary structures of the k-th position and the k-th position are the same, otherwise, recording the score as 0, traversing the whole sequence length to obtain the final score, and dividing the final score by the sequence length to obtain the similarity SS of the secondary structurei
3) Population initialization, the process is as follows:
performing fragment assembly on individuals of the population until residue positions of all positions are replaced once, and finishing initialization operation;
4) population crossing, the process is as follows:
4.1) pairing NP individuals pairwise to form NP/2 pairs, and numbering the NP/2 pairs1,a2,...,aj,...,aNP/2,j∈[1,NP/2];
4.2) randomly selecting one of the groups ajAccording to the probability p1Judging whether to carry out crossing, if so, randomly selecting a certain position of the individual, and setting the random crossing length as crosslength E [3,10 ]]Exchanging fragments of the two individuals to obtain two new individuals;
5) population variation, the process is as follows:
5.1) for individual xi,i∈[1,NP]Using the probability of variation p2Fragment assembly is carried out if random (0,1) > p2Then a fragment assembly of 3 fragments is performed, if random (0,1) < p2Then, segment assembly of 9 segments is carried out, and the individual x is obtained after segment assemblyi′;
5.2) Using energy function to group Individual x before and after AssemblyiAnd xi' separately calculating energy to obtain EiAnd Ei', if Ei′<EiThen the individual x is retainedi'; if Ei′>EiThen, according to Boltzmann probability p ═ exp { - (E)i′-Ei) KT }, to judge whether to receive the assembled individual, if random (0,1) < p, then keep the individual xi'; otherwise, individual x is retainedi
6) And (3) selecting the population by using the secondary structure similarity, wherein the process is as follows:
firstly, merging an initial population and a varied population into a new population with the population size of 2 × NP, then calculating the secondary structure similarity of individuals of the new population, sequencing the merged population according to the level of the secondary structure similarity, selecting the individuals with high primary NP secondary structure similarity as the selected population individuals, and finally setting Gen + 1;
7) judging whether the maximum iteration number G is reachedmaxIf, ifStopping iteration and outputting the individual information of the last generation of population if the conditions are met, and returning to the step 4) if the conditions are not met.
CN201810034686.4A 2018-01-15 2018-01-15 Protein structure prediction method based on secondary structure similarity Active CN108334746B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810034686.4A CN108334746B (en) 2018-01-15 2018-01-15 Protein structure prediction method based on secondary structure similarity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810034686.4A CN108334746B (en) 2018-01-15 2018-01-15 Protein structure prediction method based on secondary structure similarity

Publications (2)

Publication Number Publication Date
CN108334746A CN108334746A (en) 2018-07-27
CN108334746B true CN108334746B (en) 2021-06-18

Family

ID=62924220

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810034686.4A Active CN108334746B (en) 2018-01-15 2018-01-15 Protein structure prediction method based on secondary structure similarity

Country Status (1)

Country Link
CN (1) CN108334746B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109346128B (en) * 2018-08-01 2021-06-18 浙江工业大学 Protein structure prediction method based on residue information dynamic selection strategy
CN109360597B (en) * 2018-08-01 2020-12-01 浙江工业大学 Group protein structure prediction method based on global and local strategy cooperation
CN109147867B (en) * 2018-08-28 2021-06-18 浙江工业大学 Group protein structure prediction method based on dynamic segment length
CN109326319B (en) * 2018-08-28 2021-05-18 浙江工业大学 Protein conformation space optimization method based on secondary structure knowledge
CN109300504B (en) * 2018-08-29 2021-05-18 浙江工业大学 Protein structure prediction method based on variable isoelite selection
CN109300505B (en) * 2018-08-29 2021-05-18 浙江工业大学 Protein structure prediction method based on biased sampling
CN110556161B (en) * 2019-08-13 2022-04-05 浙江工业大学 Protein structure prediction method based on conformational diversity sampling
CN110729023B (en) * 2019-08-29 2021-04-06 浙江工业大学 Protein structure prediction method based on contact assistance of secondary structure elements
CN111951885B (en) * 2020-08-11 2022-05-03 湖南大学 Protein structure prediction method based on local bias
CN113936737B (en) * 2021-10-14 2023-05-23 温州医科大学附属眼视光医院 Method for comparing RNA structures based on RNA motif vectors, family clustering method, method for evaluating allosteric effect, method for functional annotation, system and equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3023792A1 (en) * 2014-11-18 2016-05-25 Biosims Technologies Spot control
CN106599611A (en) * 2016-12-09 2017-04-26 中南大学 Marking method and system for protein functions
WO2017070160A1 (en) * 2015-10-20 2017-04-27 Georgetown University Systems and methods for in silico drug discovery
CN106778059A (en) * 2016-12-19 2017-05-31 浙江工业大学 A kind of colony's Advances in protein structure prediction based on Rosetta local enhancements
CN107506613A (en) * 2017-08-29 2017-12-22 浙江工业大学 A kind of multi-modal protein conformation space optimization method based on multiple structural features

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3023792A1 (en) * 2014-11-18 2016-05-25 Biosims Technologies Spot control
WO2017070160A1 (en) * 2015-10-20 2017-04-27 Georgetown University Systems and methods for in silico drug discovery
CN106599611A (en) * 2016-12-09 2017-04-26 中南大学 Marking method and system for protein functions
CN106778059A (en) * 2016-12-19 2017-05-31 浙江工业大学 A kind of colony's Advances in protein structure prediction based on Rosetta local enhancements
CN107506613A (en) * 2017-08-29 2017-12-22 浙江工业大学 A kind of multi-modal protein conformation space optimization method based on multiple structural features

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Estimation of Position Specific Energy as a Feature of Protein Residues from Sequence Alone for Structural Classification";Sumaiya I;《Plos One》;20161002;第1-23页 *
"基于可持续更新模板库的蛋白质二级结构预测器";周鹏杰;《计算生物学》;20170622;第1-10页 *

Also Published As

Publication number Publication date
CN108334746A (en) 2018-07-27

Similar Documents

Publication Publication Date Title
CN108334746B (en) Protein structure prediction method based on secondary structure similarity
CN106778059B (en) A kind of group&#39;s Advances in protein structure prediction based on Rosetta local enhancement
CN108846256B (en) Group protein structure prediction method based on residue contact information
CN107633157B (en) Protein conformation space optimization method based on distribution estimation and copy exchange strategy
CN106055920B (en) It is a kind of based on the Advances in protein structure prediction that tactful copy exchanges more than stage
CN109033744B (en) Protein structure prediction method based on residue distance and contact information
CN110148437B (en) Residue contact auxiliary strategy self-adaptive protein structure prediction method
CN108062457B (en) Protein structure prediction method for structure feature vector auxiliary selection
CN109086566B (en) Group protein structure prediction method based on fragment resampling
CN109360599B (en) Protein structure prediction method based on residue contact information cross strategy
CN109872770B (en) Variable strategy protein structure prediction method combined with displacement degree evaluation
Sun et al. Smolign: a spatial motifs-based protein multiple structural alignment method
CN109360597B (en) Group protein structure prediction method based on global and local strategy cooperation
CN109346128B (en) Protein structure prediction method based on residue information dynamic selection strategy
CN109378034B (en) Protein prediction method based on distance distribution estimation
CN108763860B (en) Loop information sampling-based group protein conformation space optimization method
CN109033753B (en) Group protein structure prediction method based on secondary structure fragment assembly
CN108595910B (en) Group protein conformation space optimization method based on diversity index
CN106446601A (en) Method for labeling lncRNA functions in large scale
CN109326318B (en) Group protein structure prediction method based on Loop region Gaussian disturbance
CN109243526B (en) Protein structure prediction method based on specific fragment crossing
CN110729023B (en) Protein structure prediction method based on contact assistance of secondary structure elements
Mao et al. Identification of residue pairing in interacting β-strands from a predicted residue contact map
CN109147867B (en) Group protein structure prediction method based on dynamic segment length
CN109300505B (en) Protein structure prediction method based on biased sampling

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20221228

Address after: D1101, Building 4, Software Industry Base, No. 19, 17, 18, Haitian 1st Road, Binhai Community, Yuehai Street, Nanshan District, Shenzhen, Guangdong, 518000

Patentee after: Shenzhen Xinrui Gene Technology Co.,Ltd.

Address before: N2248, Floor 3, Xingguang Yingjing, No. 117, Shuiyin Road, Yuexiu District, Guangzhou, Guangdong 510,000

Patentee before: GUANGZHOU ZHAOJI BIOTECHNOLOGY CO.,LTD.

Effective date of registration: 20221228

Address after: N2248, Floor 3, Xingguang Yingjing, No. 117, Shuiyin Road, Yuexiu District, Guangzhou, Guangdong 510,000

Patentee after: GUANGZHOU ZHAOJI BIOTECHNOLOGY CO.,LTD.

Address before: The city Zhaohui six districts Chao Wang Road Hangzhou City, Zhejiang province 310014 18

Patentee before: JIANG University OF TECHNOLOGY