CN107180164B - Template-based multi-domain protein structure assembly method - Google Patents
Template-based multi-domain protein structure assembly method Download PDFInfo
- Publication number
- CN107180164B CN107180164B CN201710256156.XA CN201710256156A CN107180164B CN 107180164 B CN107180164 B CN 107180164B CN 201710256156 A CN201710256156 A CN 201710256156A CN 107180164 B CN107180164 B CN 107180164B
- Authority
- CN
- China
- Prior art keywords
- domain protein
- protein
- domain
- template
- atom
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 34
- 108091005763 multidomain proteins Proteins 0.000 title claims abstract description 34
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 160
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 160
- 239000011159 matrix material Substances 0.000 claims abstract description 17
- 239000013598 vector Substances 0.000 claims description 12
- 230000003993 interaction Effects 0.000 claims description 6
- 230000008569 process Effects 0.000 abstract description 2
- 229940088594 vitamin Drugs 0.000 description 3
- 229930003231 vitamin Natural products 0.000 description 3
- 235000013343 vitamin Nutrition 0.000 description 3
- 239000011782 vitamin Substances 0.000 description 3
- 150000003722 vitamin derivatives Chemical class 0.000 description 3
- 230000008827 biological function Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000003032 molecular docking Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 238000002441 X-ray diffraction Methods 0.000 description 1
- 125000003275 alpha amino acid group Chemical group 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
Landscapes
- Spectroscopy & Molecular Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Theoretical Computer Science (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Firstly, according to the structure of each domain protein, a protein structure comparison tool TM-align is utilized to compare each domain protein once, and the optimal template is found out from a multi-domain protein database; then, a Kabsch method is used for obtaining a rotation translation matrix, each domain protein is overlapped on the template, and translation rotation operation is carried out on each domain protein, so that the distance between the domain proteins is equal to the minimum allowable distance; secondly, the quality of the assembly structure is measured by adjusting the random translation and rotation of the assembly structure and utilizing the collision factors among the domain proteins, the number of atoms interacted with each other and the moving amplitude of the assembly structure relative to the template; in the assembly process, adjacent domain proteins are assembled in sequence, the assembled structures are fixed, and after all the structures are assembled, the final assembly result is output. The invention provides a template-based multi-domain protein structure assembly method with high prediction precision.
Description
Technical Field
The invention relates to the fields of biological informatics, intelligent optimization and computer application, in particular to a template-based multi-domain protein structure assembling method.
Background
Large-scale proteins are usually composed of multiple independently folded domain proteins, and the determination of the structure of multi-domain proteins can powerfully advance the progress of biological research. Domain proteins generally have a compact three-dimensional structure and specific biological functions, and the same domain proteins may have different binding domains. In addition, the single domain three-dimensional structure of many multi-domain proteins has been determined by X-ray diffraction, nuclear magnetic resonance, and computer predictions. Therefore, it is an important step to obtain the structure of the corresponding multi-domain protein according to the structure of the single-domain protein, and to determine the structure of the full-length protein and to understand the biological function of the full-length protein.
At present, there are two types of methods commonly used for predicting the structure of multi-domain proteins from single-domain proteins. The first type is assembled by fixing the structure of single domain proteins, followed by alignment. The second category assembles the structure of the entire multi-domain protein by enumerating the structures linking conformations between domain proteins. Among them, the first method can be regarded as the docking problem between proteins, and some docking methods can also be used for the assembly of multi-domain protein structures; unlike the first method, the second method can be regarded as a problem of de novo prediction of the structure of the relatively short amino acid sequence between the domain proteins, which has little sampling space due to the structure of the conformation between the linked domain proteins only being changed. However, the above method lacks template guidance, so that the assembly direction of the domain protein cannot be determined during assembly, thereby resulting in low prediction accuracy.
Therefore, the existing multi-domain protein structure assembly method has defects in prediction accuracy, and needs to be improved.
Disclosure of Invention
In order to overcome the defects of the existing multi-domain protein structure assembly method in the aspect of prediction precision, the invention provides a template-based multi-domain protein structure assembly method with higher prediction precision.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a method of template-based multi-domain protein structure assembly, the method comprising the steps of:
1) inputting the three-dimensional structure of each single-domain protein and the sequence information of the corresponding multi-domain protein;
2) setting the maximum number of iterations ImaxThreshold value d of collision distanceclashInteraction threshold dcontactConstant n number of atoms interacting0;
3) The following was performed for each multi-domain protein in the pdb (protein Data bank) library to determine the assembly template:
3.1) finding out the optimal alignment position of the first protein according to the protein structure alignment tool TM-align, and recording the template alignment score TM-score1;
3.2) finding the best alignment position for the second domain protein using TM-align starting from the last residue position of the first domain protein alignment and recording TM-score2;
3.3) repeating the step 3.2) to find the optimal alignment position of other domain proteins in turn, and recording TM-socre3,TM-socre4,…,TM-socreNN is the total number of domain proteins;
3.4) calculating the score of the templateWherein scoreiRepresents the score of the t-th template, TM-scoreiDenotes the alignment score, L, of the ith domain proteiniIs the sequence length of the ith domain protein;
3.5) calculating the score of each template through the steps 3.1) -3.4), and selecting the protein with the highest score as the template;
4) each domain protein was overlaid onto the template by the following procedure:
4.1) comparing the C α atom of the query protein with the C α atom of the template one by one, and then obtaining the result according to the Kabsch method
Rotation matrixAnd a translation vector (t)1,t2,t3),ustWhere s 1,2,3, t 1,2,3 denotes the t-th element of the s-th row of the rotation matrix, tsRepresents the s-th translation vector;
5) fixing the position of the nth domain protein, translating the nth +1 th domain protein according to the following formula to ensure that the root mean square deviation RMSD between the connection points is
Wherein lnThe length of the nth domain protein of vitamin,is the s-dimensional coordinate of the last C α atom of the nth domain protein,is the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein, dn,n+1Is the euclidean distance between the last C α atom of the nth domain protein and the first C α atom of the n +1 th domain protein;
6) calculating the root mean square deviation E of the C α atom between the current protein and the templateRMSD;
7) Calculating Euclidean distance between C α atom of the nth domain protein and C α atom of the n +1 th domain protein, and counting the distance to be less than dclashNumber n ofclashAnd recording the corresponding distanceComputing a conflict score between domains
8) Counting that the distance in step 7 is less than dcontactNumber n ofcontactAnd calculating an interaction score
9) Calculating the energy E ═ w of the current protein1ERMSD+w2Eclash+w3EcontactWherein w is1,w2,w3Are the respective weight values;
10) iteratively determining the lowest energy assembly structure by:
10.1) determining the rotation axis:X3theta, where theta is 1-2rand[0,1],φ=2πrand[0,1],rand[0,1]A random decimal between 0 and 1;
10.2) randomly generating the rotation angle γ ═ 2rand [0,1 ═ 2rand]-1 and an assembly translation vector (T)1,T2,T3) Wherein T iss=0.3(2rand[0,1]-1),s=1,2,3;
10.3) determining an assembly rotation matrix:
wherein α is cos γ, β is sin γ, UstThe t-th element representing the s-th row of the assembled rotation matrix, s being 1,2,3, t being 1,2, 3;
10.4) rotation and translation operations on each C α atom of the n +1 th domain protein:
wherein,denotes the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein, s ═ 1,2,3,
(ii) the s-dimensional coordinate representing the mth C α atom of the (n + 1) -th domain protein, s being 1,2, 3;
10.5) calculating the energy of the current assembly structure according to the steps 6) -9), and if the energy is reduced, accepting the current assembly structure;
11) repeating step 10) ImaxSecondly, the structure of the last time is an assembly structure of the nth domain protein and the (n + 1) th domain protein;
12) when the assembly of the N +1 th protein is finished, fixing the structure of the first N +1 domain protein, assembling the N +2 th domain protein according to the steps 5) -11), and outputting the final assembly structure until all the N domain proteins are assembled.
The technical conception of the invention is as follows: firstly, according to the structure of each domain protein, a protein structure comparison tool TM-align is utilized to compare each domain protein once, and an optimal template is found out from a multi-domain protein database; then, a Kabsch method is used for obtaining a rotation translation matrix, each domain protein is overlapped on the template, and translation rotation operation is carried out on each domain protein, so that the distance between the domain proteins is equal to the minimum allowable distance; secondly, the quality of the assembly structure is measured by adjusting the random translation and rotation of the assembly structure and utilizing the collision factors among the domain proteins, the number of atoms interacted with each other and the moving amplitude of the assembly structure relative to the template; in the assembly process, adjacent domain proteins are assembled in sequence, the assembled structures are fixed, and after all the structures are assembled, the final assembly result is output.
The beneficial effects of the invention are as follows: the assembly is guided through the template, the direction information of the assembled structure is obtained, and the quality of the assembled structure is measured according to the conflict factor and the interaction factor among the domain proteins and the change between the assembled structure and the template, so that the effect of guiding the assembly is achieved, and the prediction progress of the whole protein is improved.
Drawings
FIG. 1 is a flow diagram of a template-based multi-domain protein structure assembly method.
FIG. 2 is the result of the assembly of the multi-domain protein 3nd1A by the template-based multi-domain protein structure assembly method.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
Referring to fig. 1 and 2, a template-based multi-domain protein structure assembly method includes the steps of:
1) inputting the three-dimensional structure of each single-domain protein and the sequence information of the corresponding multi-domain protein;
2) setting the maximum number of iterations ImaxThreshold value d of collision distanceclashInteraction threshold dcontactConstant n number of atoms interacting0;
3) The following was performed for each multi-domain protein in the pdb (protein Data bank) library to determine the assembly template:
3.1) finding out the optimal alignment position of the first protein according to the protein structure alignment tool TM-align, and recording the template alignment score TM-score1;
3.2) finding the best alignment position for the second domain protein using TM-align starting from the last residue position of the first domain protein alignment and recording TM-score2;
3.3) repeating the step 3.2) to find the optimal alignment position of other domain proteins in turn, and recording TM-socre3,TM-socre4,…,TM-socreNN is the total number of domain proteins;
3.4) calculating the score of the templateWherein scoreiRepresents the score of the t-th template, TM-scoreiDenotes the alignment score, L, of the ith domain proteiniIs the sequence length of the ith domain protein;
3.5) calculating the score of each template through the steps 3.1) -3.4), and selecting the protein with the highest score as the template;
4) each domain protein was overlaid onto the template by the following procedure:
4.1) comparing C α atoms of the query protein with C α atoms of the template one by one, and then obtaining a rotation matrix according to a Kabsch methodAnd a translation vector (t)1,t2,t3),ustWhere s 1,2,3, t 1,2,3 denotes the t-th element of the s-th row of the rotation matrix, tsRepresents the s-th translation vector;
5) fixing the position of the nth domain protein, translating the nth +1 th domain protein according to the following formula to ensure that the root mean square deviation RMSD between the connection points is
Wherein lnThe length of the nth domain protein of vitamin,is the s-dimensional coordinate of the last C α atom of the nth domain protein,is the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein, dn,n+1Is the euclidean distance between the last C α atom of the nth domain protein and the first C α atom of the n +1 th domain protein;
6) calculating the root mean square deviation E of the C α atom between the current protein and the templateRMSD;
7) Calculating Euclidean distance between C α atom of the nth domain protein and C α atom of the n +1 th domain protein, and counting the distance to be less than dclashNumber n ofclashAnd recording the corresponding distanceComputing a conflict score between domains
8) Counting that the distance in step 7 is less than dcontactNumber n ofcontactAnd calculating an interaction score
9) Calculating the energy E ═ w of the current protein1ERMSD+w2Eclash+w3EcontactWherein w is1,w2,w3Are the respective weight values;
10) iteratively determining the lowest energy assembly structure by:
10.1) determining the rotation axis:X3θ, wherein θ is 1-2rand [0,1 ═ θ],φ=2πrand[0,1],rand[0,1]A random decimal between 0 and 1;
10.2) randomly generating the rotation angle γ ═ 2rand [0,1 ═ 2rand]-1 and an assembly translation vector (T)1,T2,T3) Wherein T iss=0.3(2rand[0,1]-1),s=1,2,3;
10.3) determining an assembly rotation matrix:
wherein α is cos γ, β is sin γ, UstThe t-th element representing the s-th row of the assembled rotation matrix, s being 1,2,3, t being 1,2, 3;
10.4) rotation and translation operations on each C α atom of the n +1 th domain protein:
wherein,denotes the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein, s ═ 1,2,3,
(ii) the s-dimensional coordinate representing the mth C α atom of the (n + 1) -th domain protein, s being 1,2, 3;
10.5) calculating the energy of the current assembly structure according to the steps 6) -9), and if the energy is reduced, accepting the current assembly structure;
11) repeating step 10) ImaxSecondly, the structure of the last time is an assembly structure of the nth domain protein and the (n + 1) th domain protein;
12) when the assembly of the N +1 th protein is finished, fixing the structure of the first N +1 domain protein, assembling the N +2 th domain protein according to the steps 5) -11), and outputting the final assembly structure until all the N domain proteins are assembled.
The multi-domain protein 3nd1A with the sequence length of 244 in the embodiment is an example, and the template-based multi-domain protein structure assembly method comprises the following steps:
1) inputting the three-dimensional structure of each single-domain protein and the sequence information of the corresponding multi-domain protein;
2) setting the maximum number of iterations Imax30000, collision distance threshold dclash3.75, interaction threshold dcontactNumber of atoms interacted constant n ═ 80=87;
3) The following procedure was performed for each multi-domain protein in the pdb (protein Data bank) library to determine the assembly template, as follows:
3.6) finding out the best alignment position of the first protein according to the protein structure alignment tool TM-align, and recording the template alignment score TM-score1;
3.7) finding the best alignment position for the second domain protein using TM-align starting from the last residue position of the first domain protein alignment and recording TM-score2;
3.8) repeating the step 3.2) to find the optimal alignment position of other domain proteins in turn, and recording TM-socre3,TM-socre4,…,TM-socreNN is the total number of domain proteins;
3.9) calculating the score of the templateWherein scoreiRepresents the score of the t-th template, TM-scoreiDenotes the alignment score, L, of the ith domain proteiniIs the sequence length of the ith domain protein;
3.10) calculating the score of each template through the steps 3.1) -3.4), and selecting the protein with the highest score as the template;
4) each domain protein was overlaid onto the template by the following procedure:
4.3) comparing C α atoms of the query protein with C α atoms of the template one by one, and then obtaining a rotation matrix according to a Kabsch methodAnd a translation vector (t)1,t2,t3),ustWhere s 1,2,3, t 1,2,3 denotes the t-th element of the s-th row of the rotation matrix, tsRepresents the s-th translation vector;
Wherein,(ii) an s-dimensional coordinate representing the mth C α atom of the nth domain protein;
5) fixing the position of the nth domain protein, translating the nth +1 th domain protein according to the following formula to ensure that the root mean square deviation RMSD between the connection points is
Wherein lnThe length of the nth domain protein of vitamin,is the s-dimensional coordinate of the last C α atom of the nth domain protein,is the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein, dn,n+1Is the euclidean distance between the last C α atom of the nth domain protein and the first C α atom of the n +1 th domain protein;
6) calculating the root mean square deviation E of the C α atom between the current protein and the templateRMSD;
7) Calculating Euclidean distance between C α atom of the nth domain protein and C α atom of the n +1 th domain protein, and counting the distance to be less than dclashNumber n ofclashAnd recording the corresponding distanceComputing a conflict score between domains
8) Counting that the distance in step 7 is less than dcontactNumber n ofcontactAnd calculating an interaction score
9) Calculating the energy E ═ w of the current protein1ERMSD+w2Eclash+w3EcontactWherein w is1=w2=1,w30.35 is the weight value;
10) Iteratively determining the lowest energy assembly structure by:
10.1) determining the rotation axis:X3θ, wherein θ is 1-2rand [0,1 ═ θ],φ=2πrand[0,1],rand[0,1]A random decimal between 0 and 1;
10.2) randomly generating the rotation angle γ ═ 2rand [0,1 ═ 2rand]-1 and an assembly translation vector (T)1,T2,T3) Wherein T iss=0.3(2rand[0,1]-1),s=1,2,3;
10.3) determining an assembly rotation matrix:
wherein α is cos γ, β is sin γ, UstThe t-th element representing the s-th row of the assembled rotation matrix, s being 1,2,3, t being 1,2, 3;
10.4) rotation and translation operations on each C α atom of the n +1 th domain protein:
wherein,denotes the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein, s ═ 1,2,3,
(ii) the s-dimensional coordinate representing the mth C α atom of the (n + 1) -th domain protein, s being 1,2, 3;
10.5) rotation and translation operations on each C α atom of the n +1 th domain protein:
wherein,represents the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein,(ii) the s-dimensional coordinate representing the m-th C α atom of the n + 1-th domain protein;
10.6) calculating the energy of the current assembly structure according to the steps 6) -9), and if the energy is reduced, accepting the current assembly structure;
11) repeating step 10) ImaxSecondly, the structure of the last time is an assembly structure of the nth domain protein and the (n + 1) th domain protein;
12) when the assembly of the N +1 th protein is finished, fixing the structure of the first N +1 domain protein, assembling the N +2 th domain protein according to the steps 5) -11), and outputting the final assembly structure until all the N domain proteins are assembled.
Using the sequence length of 244 of the two-domain multi-domain protein 3nd1A as an example, the above method is used to assemble the near-native conformation of the multi-domain protein with the root mean square deviation ofTM-score was 0.997 and the predicted structure is shown in FIG. 2.
The above description is the optimization effect of the present invention using 3nd1A protein as an example, and is not intended to limit the scope of the present invention, and various modifications and improvements can be made without departing from the scope of the present invention.
Claims (1)
1. A template-based multi-domain protein structure assembly method, characterized in that: the multi-domain protein structure assembly comprises the following steps:
1) inputting the three-dimensional structure of each single-domain protein and the sequence information of the corresponding multi-domain protein;
2) setting the maximum number of iterations ImaxThreshold value d of collision distanceclashInteraction threshold dcontactConstant n number of atoms interacting0;
3) The following operations were performed for each multi-domain protein in the PDB library, thereby determining an assembly template:
3.1) finding out the optimal alignment position of the first protein according to the protein structure alignment tool TM-align, and recording the template alignment score TM-score1;
3.2) finding the best alignment position for the second domain protein using TM-align starting from the last residue position of the first domain protein alignment and recording TM-score2;
3.3) repeating the step 3.2) to find the optimal alignment position of other domain proteins in turn, and recording TM-socre3,TM-socre4,…,TM-socreNN is the total number of domain proteins;
3.4) calculating the score of the templateWherein scoreiRepresents the score of the ith template, TM-scoreiDenotes the alignment score, L, of the ith domain proteiniIs the sequence length of the ith domain protein;
3.5) calculating the score of each template through the steps 3.1) -3.4), and selecting the protein with the highest score as the template;
4) each domain protein was overlaid onto the template by the following procedure:
4.1) comparing C α atoms of the query protein with C α atoms of the template one by one, and then obtaining a rotation matrix according to a Kabsch methodAnd a translation vector (t)1,t2,t3),ustWhere s 1,2,3, t 1,2,3 denotes the t-th element of the s-th row of the rotation matrix, tsIs shown ass translation vectors;
Wherein,(ii) an s-dimensional coordinate representing the mth C α atom of the nth domain protein;
5) fixing the position of the nth domain protein, translating the nth +1 th domain protein according to the following formula to ensure that the root mean square deviation RMSD between the connection points is
Wherein lnIs the length of the nth domain protein,is the s-dimensional coordinate of the last C α atom of the nth domain protein,is the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein, dn,n+1Is the euclidean distance between the last C α atom of the nth domain protein and the first C α atom of the n +1 th domain protein;
6) calculating the root mean square deviation E of the C α atom between the current protein and the templateRMSD;
7) Calculating Euclidean distance between C α atom in the nth domain protein and C α atom in the (n + 1) th domain protein, and countingDistance less than dclashNumber n ofclashAnd recording the corresponding distanceComputing a conflict score between domains
8) Counting that the distance in step 7 is less than dcontactNumber n ofcontactAnd calculating an interaction score
9) Calculating the energy E ═ w of the current protein1ERMSD+w2Eclash+w3EcontactWherein w is1,w2,w3Are the respective weight values;
10) iteratively determining the lowest energy assembly structure by:
10.1) determining the rotation axis:X3θ, wherein θ is 1-2rand [0,1 ═ θ],φ=2πrand[0,1],rand[0,1]A random decimal between 0 and 1;
10.2) randomly generating the rotation angle γ ═ 2rand [0,1 ═ 2rand]-1 and an assembly translation vector (T)1,T2,T3) Wherein T iss=0.3(2rand[0,1]-1),s=1,2,3;
10.3) determining an assembly rotation matrix:
wherein α is cos γ, β is sin γ, UstRepresenting the s-th row of the assembled rotation matrixThe number t of the elements is,
s=1,2,3,t=1,2,3;
10.4) rotation and translation operations on each C α atom of the n +1 th domain protein:
wherein,denotes the s-dimensional coordinate of the first C α atom of the n + 1-th domain protein, s ═ 1,2,3,(ii) the s-dimensional coordinate representing the mth C α atom of the (n + 1) -th domain protein, s being 1,2, 3;
10.5) calculating the energy of the current assembly structure according to the steps 6) -9), and if the energy is reduced, accepting the current assembly structure;
11) repeating step 10) ImaxSecondly, the structure of the last time is an assembly structure of the nth domain protein and the (n + 1) th domain protein;
12) when the assembly of the N +1 th protein is finished, fixing the structure of the first N +1 domain protein, assembling the N +2 th domain protein according to the steps 5) -11), and outputting the final assembly structure until all the N domain proteins are assembled.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710256156.XA CN107180164B (en) | 2017-04-19 | 2017-04-19 | Template-based multi-domain protein structure assembly method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710256156.XA CN107180164B (en) | 2017-04-19 | 2017-04-19 | Template-based multi-domain protein structure assembly method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107180164A CN107180164A (en) | 2017-09-19 |
CN107180164B true CN107180164B (en) | 2020-02-21 |
Family
ID=59831420
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710256156.XA Active CN107180164B (en) | 2017-04-19 | 2017-04-19 | Template-based multi-domain protein structure assembly method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107180164B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108062457B (en) * | 2018-01-15 | 2021-06-18 | 浙江工业大学 | Protein structure prediction method for structure feature vector auxiliary selection |
CN108763870B (en) * | 2018-05-09 | 2021-08-03 | 浙江工业大学 | Construction method of multi-domain protein Linker |
CN110164506B (en) * | 2019-04-19 | 2021-02-26 | 浙江工业大学 | Multi-domain protein structure assembly method based on inter-domain residue contact |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2049679A4 (en) * | 2006-06-28 | 2010-02-24 | P A Technologies Llc | Methods and materials for in vitro analysis and/or use of membrane-associated proteins, portions thereof or variants thereof |
CN103566363B (en) * | 2013-09-23 | 2015-12-09 | 中国人民解放军第三军医大学第一附属医院 | The preparation method of contraceptive microneedle |
CN105354441B (en) * | 2015-10-23 | 2018-02-02 | 上海交通大学 | The method for building phytoprotein interaction network |
CN105808972A (en) * | 2016-03-11 | 2016-07-27 | 浙江工业大学 | Method for predicting protein structure from local to global on basis of knowledge spectrum |
-
2017
- 2017-04-19 CN CN201710256156.XA patent/CN107180164B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN107180164A (en) | 2017-09-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Senior et al. | Improved protein structure prediction using potentials from deep learning | |
Ragoza et al. | Generating 3D molecules conditional on receptor binding sites with deep generative models | |
Nerli et al. | Chemical shift-based methods in NMR structure determination | |
CN107180164B (en) | Template-based multi-domain protein structure assembly method | |
Schneidman-Duhovny et al. | Modeling structure and dynamics of protein complexes with SAXS profiles | |
Bhattacharya et al. | UniCon3D: de novo protein structure prediction using united-residue conformational search via stepwise, probabilistic sampling | |
Chang et al. | AlphaFold encodes the principles to identify high affinity peptide binders | |
Al Nasr et al. | Building the initial chain of the proteins through de novo modeling of the cryo-electron microscopy volume data at the medium resolutions | |
Hunkler et al. | Back-mapping based sampling: Coarse grained free energy landscapes as a guideline for atomistic exploration | |
Pagès et al. | DeepSymmetry: using 3D convolutional networks for identification of tandem repeats and internal symmetries in protein structures | |
Fossi et al. | Quantitative study of the effects of chemical shift tolerances and rates of SA cooling on structure calculation from automatically assigned NOE data | |
Tuvi-Arad et al. | Improved algorithms for quantifying the near symmetry of proteins: complete side chains analysis | |
Moreta et al. | A probabilistic programming approach to protein structure superposition | |
Niklasson et al. | Fast and accurate resonance assignment of small-to-large proteins by combining automated and manual approaches | |
Li et al. | A stochastic point cloud sampling method for multi-template protein comparative modeling | |
CN110189794B (en) | Residue contact guided loop perturbation population protein structure prediction method | |
CN107609345B (en) | Multi-domain protein structure assembly method based on template self-adaptive selection | |
CN110164506B (en) | Multi-domain protein structure assembly method based on inter-domain residue contact | |
Katzberger et al. | A general graph neural network based implicit solvation model for organic molecules in water | |
Mirceva et al. | Protein classification by using four approaches for extraction of the protein ray-based descriptor | |
Al Nasr et al. | Geometry analysis for protein secondary structures matching problem | |
Tai et al. | SE: an algorithm for deriving sequence alignment from a pair of superimposed structures | |
DeBlasio et al. | Predicting core columns of protein multiple sequence alignments for improved parameter advising | |
US20230377690A1 (en) | Protein sequence and structure generation with denoising diffusion probabilistic models | |
Suryanto et al. | Combination of multiple distance measures for protein fold classification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |