CN105260626A - Complete prediction method for protein structure spatial conformation - Google Patents
Complete prediction method for protein structure spatial conformation Download PDFInfo
- Publication number
- CN105260626A CN105260626A CN201510623583.8A CN201510623583A CN105260626A CN 105260626 A CN105260626 A CN 105260626A CN 201510623583 A CN201510623583 A CN 201510623583A CN 105260626 A CN105260626 A CN 105260626A
- Authority
- CN
- China
- Prior art keywords
- conformation
- protein
- frequency
- shape code
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 87
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 86
- 238000000034 method Methods 0.000 title claims abstract description 39
- 230000012846 protein folding Effects 0.000 claims abstract description 21
- 238000012360 testing method Methods 0.000 claims abstract description 7
- 150000001413 amino acids Chemical class 0.000 claims description 27
- 238000001228 spectrum Methods 0.000 claims description 10
- 239000013589 supplement Substances 0.000 claims description 8
- 239000000284 extract Substances 0.000 claims description 3
- 239000000047 product Substances 0.000 claims description 2
- 238000006467 substitution reaction Methods 0.000 claims description 2
- 102100036789 Protein TBATA Human genes 0.000 abstract 1
- 101710118245 Protein TBATA Proteins 0.000 abstract 1
- 238000013537 high throughput screening Methods 0.000 abstract 1
- 230000003595 spectral effect Effects 0.000 abstract 1
- 239000013598 vector Substances 0.000 description 6
- 241001505049 Balantiocheilos melanopterus Species 0.000 description 4
- 102000055006 Calcitonin Human genes 0.000 description 4
- 108060001064 Calcitonin Proteins 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- CWHJIJJSDGEHNS-MYLFLSLOSA-N Senegenin Chemical compound C1[C@H](O)[C@H](O)[C@@](C)(C(O)=O)[C@@H]2CC[C@@]3(C)C(CC[C@]4(CCC(C[C@H]44)(C)C)C(O)=O)=C4[C@@H](CCl)C[C@@H]3[C@]21C CWHJIJJSDGEHNS-MYLFLSLOSA-N 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- BBBFJLBPOGFECG-VJVYQDLKSA-N calcitonin Chemical compound N([C@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(N)=O)C(C)C)C(=O)[C@@H]1CSSC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1 BBBFJLBPOGFECG-VJVYQDLKSA-N 0.000 description 3
- 229960004015 calcitonin Drugs 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 239000009871 tenuigenin Substances 0.000 description 3
- 239000003814 drug Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000000455 protein structure prediction Methods 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 101710134784 Agnoprotein Proteins 0.000 description 1
- 101000859758 Homo sapiens Cartilage-associated protein Proteins 0.000 description 1
- 101000916686 Homo sapiens Cytohesin-interacting protein Proteins 0.000 description 1
- 101000726740 Homo sapiens Homeobox protein cut-like 1 Proteins 0.000 description 1
- 101000761460 Homo sapiens Protein CASP Proteins 0.000 description 1
- 101000761459 Mesocricetus auratus Calcium-dependent serine proteinase Proteins 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 101710124584 Probable DNA-binding protein Proteins 0.000 description 1
- 102100024933 Protein CASP Human genes 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000001149 cognitive effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
The invention relates to a complete prediction method for a protein structure spatial conformation, and belongs to the field of bioinformatics. For any protein sequence, a corresponding protein folding conformation can be obtained by adopting a protein structure fingerprint technique directly through high-throughput screening of a 5AAPFSC database. Each folding conformation is represented by protein folding shape code (PFSC) letters, and folding structures cover a secondary structure and a tertiary structure. All possible folding shape codes can be aligned to form an array, thereby generating a PFSC protein spatial conformation spectral band as a prediction result. Through tests of a large amount of known three-dimensional structured proteins, the reliability and effectiveness of the method are well verified.
Description
Technical field
The present invention relates to a kind of perfect information Forecasting Methodology of protein structure space conformation, belong to field of bioinformatics.
Background technology
Protein structure carries out genomics, bioinformatics, the important information of medicament research and development and biotechnology research
1,2.But, up to the present, only have and approximately measure acquisition less than the three-dimensional structure of the protein of 1% by experimental techniques such as X-ray crystallographic or nuclear magnetic resonance
3.Still the sequence about more than 52,000,000 protein is also had still not have information and the data of three-dimensional structure
4, biological medicine research urgently wishes the space structure determining these albumen.For a long time, based on microcomputer modelling, many methods and applications about protein structure prediction are developed.From 1994, " key evaluation (CASP) of the protein structure prediction " activity every two years held once became an intercommunion platform of countries in the world protein molecule bio-science man
5,6.In view of the complicacy of protein structure, and the possible folding mode of exponential number level, one of research puzzle 100 large challenge subjects being listed in 21 century modern science of predicted protein structure
7.
Up to now, the method for various predicted protein structure can be divided into three major types other substantially.The first kind is the modeling method based on sequence
8,9,10.The method utilizes known protein structure to solve agnoprotein structure.This method needs the similarity degree comparison information extraction depended between sequence, is a query about the degree of reliability predicted the outcome always.Equations of The Second Kind is the splicing modeling method adopting folding configuration identification
11,12,13,14,15.The method employing statistical method screens the mutual relationship between fold segments and sequence from specific albumen database.Statistical method no doubt can contain most of folding configuration, but the lower folding configuration of frequency has often just been left in the basket.3rd class is ab iitio model method
16,17,18.The method uses the interaction between the amino acid of computing machine repeatedly in iterative computation protein and atom, and to the last whole conformation system is tending towards a lower energy state.The computer time that the method consumption is a large amount of and resource, and prediction only can obtain a possible space structure of related protein.For a long time, biologist expects to obtain protein structure that is reliable and that have no objection by Forecasting Methodology.As target, the Forecasting Methodology improving protein structure is attempted in various research, but progress is in this respect very undesirable.Searching to the bottom reason, is the complicacy due to protein structure itself and polytrope.
Summary of the invention
Technical matters to be solved by this invention is to provide a kind of perfect information prediction (CompletePredictionforProteinConformation, CPPC) method of protein conformation.The method uses digital model to simplify the complicacy of protein structure, uses perfect information structured data to carry out the polytrope of cognitive protein structure simultaneously.The method can the structure of fast prediction protein, and provides all possible protein steric conformation.
The perfect information Forecasting Methodology of protein conformation of the present invention be based upon inventor before the new method of the predicted protein structure of exploitation on PFSC (ProteinFoldingShapeCode, PFSC) basis disclosed in patent ZL200880003164.2
19.The PFSC (PFSC) obtained by strictly deriving can describe the collapsed shape of continuous print 5 amino acid fragments in good condition.Any collapsed shape of 5 amino acid fragments in protein can be described by 27PFSC vector, and whole 27PFSC vector have employed 26 English alphabets and adds $ symbols.The more important thing is, whole 27 PFSC vector covers a complete mathematical space.And the collapsed shape of whole 27 PFSC vectors is highly closely related.Each PFSC vector can be another vector from a vectorial transition and conversion.
From mathematical angle amino acid, by different sequences, 5 amino acid can form different arrangements.From whole 20 amino acid, at random extract 5 amino acid can be formed and add up to 3,200, the difference arrangement of 000.The possible folded conformation of each arrangement can obtain from global Protein Data Bank (PDB), then represents with protein folding shape code (PFSC).On this basis, we create a database to collect the folded conformation of above-mentioned 3,200,000 arrangements.This brand-new database is named as 5AAPFSC.In this database, the collapsed shape relevant with each arrangement stores intactly adopting corresponding PFSC code wherein.
The perfect information Forecasting Methodology of protein structure space conformation of the present invention, comprises the steps:
1) from whole 20 amino acid, 5 amino acid are at random extracted, formation adds up to 3, the difference arrangement of 200,000, the possible folded conformation of each arrangement obtains from global Protein Data Bank (PDB), then represents with protein folding shape code (PFSC); Create a database to collect the protein folding shape code of above-mentioned arrangement and correspondence thereof, this database is named as 5AAPFSC, as shown in Figure 1;
2) for the protein of any one structure to be predicted, along the sequence of protein, from N-end, progressively move to C-end, read every 5 continuous print amino acid successively, its folded conformation that may have directly obtains from 5AAPFSC database, with the character representation of protein folding shape code (PFSC); In Protein Data Bank, the character of the folded conformation code that the frequency of occurrences is the highest makes number one, the folded conformation code character that the frequency of occurrences second is high comes second, form row successively from top to bottom, till collection completely, every 5 continuous print amino acid have the folded conformation possibility of different number;
3) the whole possible collapsed shape code of testing protein forms an array, is called protein folding conformation bands of a spectrum, as shown in Figure 2, represents along all possible folded conformation of the sequence of protein; For each protein sequence, by the phase trans-substitution of its all possible partial folds conformation, all possible conformation can be obtained exactly; The total number of possible conformation is the continued product of all every 5 amino acid possibility folded conformation numbers;
For any one testing protein, although all the number of possible space conformation is huge, the space conformation that possibility is high is obtained by the local folded conformation that the frequency of occurrences is high.For example, first space conformation is made up of the folded configuration shape code that the frequency of occurrences is the highest; Second space conformation is by the high collapsed shape code of the frequency of occurrences second, is not having the second high-frequency conformation position, with frequency the highest collapsed shape code formed as a supplement; 3rd space conformation is by frequency of occurrences third high collapsed shape code, is not having third high frequency conformation position, with frequency the highest collapsed shape code formed as a supplement; So analogize, form a series of possible prediction conformation that possibility is higher.
Therefore, a succession of protein folding shape code be made up of high-frequency conformation is exactly the higher protein steric structural conformation of possibility.According to protein folding conformation bands of a spectrum, the change of more local can be found and substitute, carrying out revising formation more about possible space structure conformation.
The protein steric conformation bands of a spectrum that this analysis method obtains provide the prediction of a perfect information to protein structure space folding conformation, disclose the subtle change of its possible any local conformation simultaneously.A significance of perfect information prediction (CPPC) method of protein conformation will create necessary condition for building a brand-new protein gene structure composition database from now on.The perfect information prediction of protein conformation is the new method of of predicted protein structure, and the method will promote the development of protein structure genomics.The perfect information forecasting techniques of the protein conformation of our exploitation not only provides complete folded conformation to the prediction of protein structure, and is of great significance for the protein structure that complete understanding obtains from measuring.
Accompanying drawing explanation
The structure of Fig. 1,5AAPFSC database.
Fig. 2, the foundation of protein folding conformation bands of a spectrum.
The contrast that 2XCW protein fragments (residue 3-62) the known conformation of Fig. 3, people's tenuigenin 5'-nucleotide II albumen and perfect information predict the outcome.Form the first row is the amino acid sequence segments (3-62) of this albumen.Then be the folded conformation of 8 known structure, folded conformation protein folding shape code (PFSC) represents.In form, lower semisection is 9 possible space conformations of prediction.
Fig. 4, marine aquatic biological silver shark (CallorhinchusMiliiX) extract 32 amino acid whose calcitonin prediction space conformation.
Embodiment
For any protein sequence, use protein structure fingerprint technique (PSFT), directly by the high flux screening to 5AAPFSC database, the protein folding conformation of its correspondence will be obtained.Each folded conformation will be represented by protein folding shape code (PFSC) letter, and wherein each letter represents the characteristic of its proprietary foldable structure, and these foldable structures cover secondary structure and tertiary structure.All possible collapsed shape code can align formation array, generates PFSC protein steric conformation bands of a spectrum as predicting the outcome.By the test of the protein to a large amount of known three-dimensional structure, demonstrate reliability and the validity of the method well.
Embodiment one selects one to know, and the protein of three-dimensional structure contrasts with predicting the outcome as an example.
People's tenuigenin 5'-nucleotide II protein is a protein molecular with known three-dimensional structure, and its three-dimensional structure is determined by the experiment of X-ray crystallographic.The first half of Fig. 3 lists the space conformation of testing people's tenuigenin 5'-nucleotide II protein 8 structures recorded from X-ray crystallographic.Its each three-D space structure can obtain from Protein Data Bank.Then, each conformation folded code is expressed, and array is lined up in alignment.The configuration state recorded is tested in the representative of each space conformation.The Lower Half of Fig. 3 is listed the inventive method and is predicted 9 the most probable space conformations obtained.The step that these space conformations can be described by earlier paragraphs obtains.First space conformation is made up of the character string of the highest folded conformation code of the frequency of occurrences; Second space conformation is high by the frequency of occurrences second, add form frequency the highest the character string of folded conformation code form as a supplement; 3rd space conformation is by frequency of occurrences third high, add form frequency the highest the character string of folded conformation code form as a supplement; So analogize, form 9 conformations that possibility is higher.As can be seen from the table, if with the known conformation of 2XCW protein fragments 60 amino acid sequences (3-62) for reference, the result extracting the first row prediction is compared, and the result conformation of perfect information prediction has 45 folded conformations identical, 5 similar, 10 differences.Only consider predicting the outcome of the first row, so-called rate of accuracy reached is to about 80%.
On the other hand, molecular biologist is recognized, tests from X-ray crystallographic some static structure states that the structured data recorded is protein, can not reflect that the whole of protein may dynamic conformational.Protein 8 known spatial conformations that table one upper semisection is listed, these space conformations show the changeability of its structure.Contrast these changes, the prediction bands of a spectrum of perfect information prediction can contain the folding configuration of these changes completely.The data of form effectively illustrate that the perfect information forecasting techniques of the protein conformation that we develop not only provides complete folded conformation to the prediction of protein structure, and are of great significance for the protein structure that complete understanding obtains from measuring.
Embodiment two selects the albumen of a unknown three-D space structure as an example, and its three-dimensional conformation can be obtained by holoprotein information prediction.Fig. 4 illustrates and predicts from the space conformation of the calcitonin polypeptide of marine aquatic biological silver shark (CallorhinchusMiliiX) 32 Amino acid profiles.4 steps that these space conformations can be described by earlier paragraphs obtain.First space conformation is made up of the character string of the highest folded conformation code of the frequency of occurrences; Second space conformation is high by the frequency of occurrences second, add form frequency the highest the character string of folded conformation code form as a supplement; 3rd space conformation is by frequency of occurrences third high, add form frequency the highest the character string of folded conformation code form as a supplement; So analogize, form 13 conformations that possibility is higher.Predict that the protein steric conformation bands of a spectrum of the silver shark calcitonin obtained are made up of 13 protein folding shape code (PFSC) character codes.These bands of a spectrum are the complete prediction to silver shark calcitonin space conformation, and illustrate may changing of its local conformation.
Perfect information prediction (CPPC) method of protein conformation of the present invention has following four characteristics and breakthrough.
1. protein conformation perfect information prediction (CPPC) based on tight mathematical derivation and and protein structure feature combine.First, 27PFSC protein folding shape code intactly represents the complete closure space that has essential meaning, and this ensures that thering predicts the outcome can not produce disappearance and omit.On 5 amino acid bases, by setting up 20 amino acid and and 27 PFSC protein folding shape code correlativitys.Combining global albumen database, the feature of the protein structure that is closely connected, creates 5AAPFSC database, enumerates any 5 amino acid whose all possible mathematics arrangements in 20 amino acid.For traditional protein tertiary structure method, the new method of the CPPC set up according to the correlativity between these arrangement and PFSC codes has solid mathematics basis.
2. protein conformation perfect information prediction (CPPC) provide fast prediction protein structure by way of.According to current computer technology level, if every 10
-13second calculates a conformation.For 100 amino acid whose protein sequences, if allow each amino-acid variants 10 locus, sum 10 will be produced
100space conformation.Complete these conformations and need 10
77year completes.For onesize protein sequence, use perfect information prediction (CPPC) technology of protein conformation, only need about 30 second time.Perfect information prediction (CPPC) of protein conformation, for the structure prediction completing the thousands of protein sequence of length, also only needs about 120 seconds.
3. the whole possible local that perfect information prediction (CPPC) of protein conformation is shown along protein sequence by PFSC protein folding shape code folds change.These locals fold change and combine the space conformation that can form exponential total quantity.The information of these magnanimity is exposed in perfect information prediction conformation bands of a spectrum completely.
4. perfect information prediction (CPPC) of protein conformation can predict possible protein conformation.From the space conformation of enormous quantity, possible space conformation is doped according to the local folded conformation frequency of occurrences.
List of references:
------------
1Jumpupto:abPSIAssessmentPanel."ReportoftheProteinStructureInitiativeAssessmentPanel".RetrievedDecember5,2008
2Baker,D.;Sali,A.(Oct2001)."Proteinstructurepredictionandstructuralgenomics",Science294(5540):93–6
3Yonath,Ada.X-raycrystallographyattheheartoflifescience.CurrentOpinioninStructuralBiology.Volume21,Issue5,October2011,Pages622–626.
4Rigden,DanielJ.FromProteinStructuretoFunctionwithBioinformatics.SpringerScience.2009.ISBN978-1-4020-9057-8.
5MoultJ.etal.Alarge-scaleexperimenttoassessproteinstructurepredictionmethods,1995;Proteins23
6http://predictioncenter.org
7Jumpup,Editorial:Somuchmoretoknow.Science2005,309:78-102
8ZhangY(2008)."Progressandchallengesinproteinstructureprediction".CurrOpinStructBiol18(3):342–8.
9YiHea,S.Rackovskya,YanpingYina,andHarold.Scheragaa,Alternativeapproachtoproteinstructurepredictionbasedonsequentialsimilarityofphysicalproperties,PNAS,2015,112(16):5029-5032
10Ashtawy,H.M.;Mahapatra,N.R.,"AComparativeAssessmentofPredictiveAccuraciesofConventionalandMachineLearningScoringFunctionsforProtein-LigandBindingAffinityPrediction,"ComputationalBiologyandBioinformatics,IEEE/ACMTransactionson,vol.12,no.2,pp.335,347,2015
11BowieJU,LuthyR,EisenbergD;Lüthy;Eisenberg(1991)."Amethodtoidentifyproteinsequencesthatfoldintoaknownthree-dimensionalstructure".Science253(5016):164–170.
12JT.Huang,TWang,SR.HuangandXLi,Reducedalphabetforproteinfoldingprediction,Proteins,2015,83-4,631–63
13BowieJU,LüthyR,EisenbergD(1991)."Amethodtoidentifyproteinsequencesthatfoldintoaknownthree-dimensionalstructure".Science253(5016):164–170.
14JonesDT,TaylorWR,ThorntonJM(1992)."Anewapproachtoproteinfoldrecognition".Nature358(6381):86–89..
15Peng,Jian;JinboXu(2011)."RaptorX:exploitingstructureinformationforproteinalignmentbystatisticalinference".Proteins.79Suppl10:
16Pierce,LeviC.T.;Salomon-Ferrer,Romelia;AugustoF.deOliveira,Cesar;McCammon,J.Andrew;Walker,RossC.(2012)."RoutineAccesstoMillisecondTimeScaleEventswithAcceleratedMolecularDynamics".JournalofChemicalTheoryandComputation8(9):2997–3002.
17Nugent,T.;Jones,D.T.(2012)."Accuratedenovostructurepredictionoflargetransmembraneproteindomainsusingfragment-assemblyandcorrelatedmutationanalysis".ProcNatlAcadSciUSA109(24):E1540–7.
18Morcos,F.;Pagnani,A.;Lunt,B.;Bertolino,A.;Marks,DS.;Sander,C.;Zecchina,R.;Onuchic,JN.etal.(Dec2011)."Direct-couplinganalysisofresiduecoevolutioncaptures
---------
nativecontactsacrossmanyproteinfamilies".ProcNatlAcadSciUSA108(49):E1293–301.
19YangJ.Comprehensivedescriptionofproteinstructuresusingproteinfoldingshapecode.Proteins2008;71.3:1497-1518。
Claims (4)
1. a perfect information Forecasting Methodology for protein structure space conformation, is characterized in that, comprise the steps:
1) from whole 20 amino acid, at random extract 5 amino acid, formed and add up to 3,200, the difference arrangement of 000, the possible folded conformation of each arrangement obtains from global Protein Data Bank, then represents with protein folding shape code; Create a database to collect the protein folding shape code of above-mentioned arrangement and correspondence thereof, this database is named as 5AAPFSC;
2) for the protein of any one structure to be predicted, along the sequence of protein, from N-end, progressively move to C-end, read every 5 continuous print amino acid successively, its folded conformation that may have directly obtains from 5AAPFSC database, with the character representation of protein folding shape code; In Protein Data Bank, the character of the folded conformation code that the frequency of occurrences is the highest makes number one, the folded conformation code character that the frequency of occurrences second is high comes second, form row successively from top to bottom, till collection completely, every 5 continuous print amino acid have the folded conformation possibility of different number;
3) the whole possible collapsed shape code of testing protein forms an array, is called protein folding conformation bands of a spectrum, represents along all possible folded conformation of the sequence of protein; For each protein sequence, by the phase trans-substitution of its all possible partial folds conformation, all possible conformation can be obtained exactly; The total number of possible conformation is the continued product of all every 5 amino acid possibility folded conformation numbers.
2. perfect information Forecasting Methodology according to claim 1, is characterized in that, predicts that the space conformation obtained is made up of the collapsed shape code that the frequency of occurrences is the highest.
3. perfect information Forecasting Methodology according to claim 1, it is characterized in that, predict that a space conformation obtaining is by the high collapsed shape code of the frequency of occurrences second, there is no the second high-frequency conformation position, with frequency the highest collapsed shape code formed as a supplement.
4. perfect information Forecasting Methodology according to claim 1, it is characterized in that, predict that a space conformation obtaining is by frequency of occurrences third high collapsed shape code, there is no third high frequency conformation position, with frequency the highest collapsed shape code formed as a supplement.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510623583.8A CN105260626B (en) | 2015-09-25 | 2015-09-25 | The full information Forecasting Methodology of protein structure space conformation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510623583.8A CN105260626B (en) | 2015-09-25 | 2015-09-25 | The full information Forecasting Methodology of protein structure space conformation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105260626A true CN105260626A (en) | 2016-01-20 |
CN105260626B CN105260626B (en) | 2017-11-14 |
Family
ID=55100315
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510623583.8A Active CN105260626B (en) | 2015-09-25 | 2015-09-25 | The full information Forecasting Methodology of protein structure space conformation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105260626B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107451421A (en) * | 2017-06-21 | 2017-12-08 | 南京迈格罗医药科技有限公司 | epitope conformation fingerprint database |
CN107463793A (en) * | 2017-06-21 | 2017-12-12 | 南京迈格罗医药科技有限公司 | Complementary antibody determines area's conformation fingerprint database |
CN108959852A (en) * | 2017-05-24 | 2018-12-07 | 北京工业大学 | Prediction technique on protein based on the pairs of Preference information of amino acid-nucleotide with RNA binding modules |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1998004734A1 (en) * | 1996-07-25 | 1998-02-05 | Wisconsin Alumni Research Foundation | Synthetic protein folding catalysis |
CN101082944A (en) * | 2007-06-01 | 2007-12-05 | 哈尔滨工程大学 | Computer simulation method for protein folding procedure based on synthesis algorithm |
CN101647022A (en) * | 2007-01-31 | 2010-02-10 | 桑迪亚医药技术(上海)有限责任公司 | Methods, systems, algorithyms and means for describing the possible conformations of actual and theoretical proteins and for evaluating actual and theoretical proteins with respect to folding, overall |
-
2015
- 2015-09-25 CN CN201510623583.8A patent/CN105260626B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1998004734A1 (en) * | 1996-07-25 | 1998-02-05 | Wisconsin Alumni Research Foundation | Synthetic protein folding catalysis |
CN101647022A (en) * | 2007-01-31 | 2010-02-10 | 桑迪亚医药技术(上海)有限责任公司 | Methods, systems, algorithyms and means for describing the possible conformations of actual and theoretical proteins and for evaluating actual and theoretical proteins with respect to folding, overall |
CN101082944A (en) * | 2007-06-01 | 2007-12-05 | 哈尔滨工程大学 | Computer simulation method for protein folding procedure based on synthesis algorithm |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108959852A (en) * | 2017-05-24 | 2018-12-07 | 北京工业大学 | Prediction technique on protein based on the pairs of Preference information of amino acid-nucleotide with RNA binding modules |
CN107451421A (en) * | 2017-06-21 | 2017-12-08 | 南京迈格罗医药科技有限公司 | epitope conformation fingerprint database |
CN107463793A (en) * | 2017-06-21 | 2017-12-12 | 南京迈格罗医药科技有限公司 | Complementary antibody determines area's conformation fingerprint database |
Also Published As
Publication number | Publication date |
---|---|
CN105260626B (en) | 2017-11-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Nanni et al. | Prediction of protein structure classes by incorporating different protein descriptors into general Chou’s pseudo amino acid composition | |
Park et al. | Factors affecting the ability of energy functions to discriminate correct from incorrect folds | |
Andrec et al. | Protein backbone structure determination using only residual dipolar couplings from one ordering medium | |
Zhao et al. | Antibody-specified B-cell epitope prediction in line with the principle of context-awareness | |
Berjanskii et al. | Unraveling the meaning of chemical shifts in protein NMR | |
JP2010517195A (en) | Methods, systems, algorithms and means for the description of possible structures of actual and theoretical proteins, and for the evaluation of actual and theoretical proteins with respect to folding, overall shape and structure motifs | |
Gattani et al. | StackCBPred: A stacking based prediction of protein-carbohydrate binding sites from sequence | |
CN105260626A (en) | Complete prediction method for protein structure spatial conformation | |
Andrec et al. | Complete protein structure determination using backbone residual dipolar couplings and sidechain rotamer prediction | |
Lo et al. | Conformational epitope matching and prediction based on protein surface spiral features | |
Runthala et al. | Refined template selection and combination algorithm significantly improves template-based modeling accuracy | |
CN107526939B (en) | Rapid alignment method for small molecular structure | |
CN113409897A (en) | Method, apparatus, device and storage medium for predicting drug-target interaction | |
CN111048145B (en) | Method, apparatus, device and storage medium for generating protein prediction model | |
US8024127B2 (en) | Local-global alignment for finding 3D similarities in protein structures | |
Tripathi et al. | TemPred: A novel protein template search engine to improve protein structure prediction | |
Wu et al. | OPUS-Dom: applying the folding-based method VECFOLD to determine protein domain boundaries | |
Sekmen et al. | Subspace modeling for classification of protein secondary structure elements from Cα trace | |
Lee et al. | Ranked k-spectrum kernel for comparative and evolutionary comparison of exons, introns, and cpg islands | |
Planas-Iglesias et al. | AggreProt: a web server for predicting and engineering aggregation prone regions in proteins | |
US20170323050A1 (en) | Structural analysis of proteins by structural representation and comparison of proteins | |
CN107451421A (en) | epitope conformation fingerprint database | |
Joo et al. | Prediction of the secondary structures of proteins by using PREDICT, a nearest neighbor method on pattern space | |
Lo et al. | Conformational epitope matching and prediction based on protein surface spiral | |
Pollastri et al. | Distill: a machine learning approach to ab initio protein structure prediction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231110 Address after: Room E and F, 26th Floor, No. 828-838 Zhangyang Road, China (Shanghai) Pilot Free Trade Zone, Pudong New Area, Shanghai, 201208 Patentee after: McCullough Biotechnology (Shanghai) Co.,Ltd. Address before: Building B5, No. 666 Gaoxin Avenue, Donghu Development Zone, Wuhan City, Hubei Province, 430075 Patentee before: Mccollow Pharmaceutical Technology (Wuhan) Co.,Ltd. |
|
TR01 | Transfer of patent right |