CN105808973A - Staged multi-strategy-based group conformation space sampling method - Google Patents

Staged multi-strategy-based group conformation space sampling method Download PDF

Info

Publication number
CN105808973A
CN105808973A CN201610121504.8A CN201610121504A CN105808973A CN 105808973 A CN105808973 A CN 105808973A CN 201610121504 A CN201610121504 A CN 201610121504A CN 105808973 A CN105808973 A CN 105808973A
Authority
CN
China
Prior art keywords
conformation
psi
target
dihedral angle
angle phi
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610121504.8A
Other languages
Chinese (zh)
Other versions
CN105808973B (en
Inventor
张贵军
周晓根
俞旭锋
郝小虎
王柳静
徐东伟
李章维
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN201610121504.8A priority Critical patent/CN105808973B/en
Publication of CN105808973A publication Critical patent/CN105808973A/en
Application granted granted Critical
Publication of CN105808973B publication Critical patent/CN105808973B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations

Landscapes

  • Bioinformatics & Cheminformatics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Analytical Chemistry (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The invention discloses a staged multi-strategy-based group conformation space sampling method. The method comprises the following steps: dividing a whole algorithm process into a plurality of stages under the framework of a differential evolution algorithm, setting a group of strategy pools for each stage, and when the algorithm achieves a certain stage, randomly selecting a strategy from the corresponding strategy pool to generate a new test conformation individual on the basis of a fragment assembling technology, so as to improve the conformation sampling ability and the convergence rate of the algorithm; introducing distance spectrum constraint in the conformation selection link, when the energy of the tested conformation is higher than a target conformation, comparing the distance differences of the tested conformation and of the target conformation, and if the distance difference of the test individual is relatively small, receiving the tested conformation at a certain probability, so as to guide the algorithm sampling to obtain a conformation with lower energy and a more reasonable structure and then improve the prediction precision of the algorithm.

Description

A kind of based on the interim shifty colony conformational space method of sampling
Technical field
The present invention relates to bioinformatics, computer application field, in particular a kind of based on the interim shifty colony conformational space method of sampling.
Background technology
Having numerous protein (long-chain formed by 2 several amino acids) in biological cell, these macromole play an important role in organism, most important for completing biological function.Protein molecule has reacted the important relationship between its structure and function on a molecular scale, and different protein plays a different role in organism.The space structure of protein often determines its function, therefore, protein structure prediction to the design of new albumen, drug design, protein stability prediction and protein between interaction modeling most important.
The structure of protein is generally divided into four levels: primary structure (aminoacid sequence), secondary structure (partial structurtes formed that interact between skeletal atom), tertiary structure (secondary structure piles up the space structure formed on a large scale) and quarternary structure (describing the interaction between different subunit).Especially, the three dimensional structure (native state structure) of protein is the key of the biological function understanding protein.Protein three-dimensional structure can be obtained by experimental techniques such as nuclear magnetic resonance, NMR and X-ray crystal diffractions, but these experimental determining methods are not only consuming time but also extremely expensive, for inapplicable the protein of some not easily crystallization.Therefore, according to the thermodynamics hypothesis (conformation with minimum energy is considered as native state structure) of Anfinsen, a lot of computational algorithms are proposed for protein structure prediction.
Carrying out protein tertiary structure by computing technique and be usually directed to an energy function evaluating very expensive, its energy function curved surface is generally of thousands of degree of freedom and substantial amounts of locally optimal solution.So huge higher-dimension conformational space carries out sampling extremely difficult.In order to carry out conformational space search, first ab initio prediction method generally obtains the Global optimal solution of conformational space according to Knowledge based engineering coarseness energy model, then the conformation of its correspondence is carried out refine, thus obtaining pre-geodesic structure.Therefore, ab initio prediction method need solve two problems: 1. set up suitable energy function to weigh in protein molecule not homoatomic between interaction;2. propose effective conformational space searching method and search for energy Global optimal solution.
Differential evolution algorithm (DE) has proved to be Stochastic Global Optimization Algorithms simple and the most powerful in evolution algorithm.Due to its simple in construction, it is easy to accomplish, the advantage such as strong robustness and fast convergence rate has been successfully applied to protein tertiary structure.But, along with the growth of aminoacid sequence, protein molecule system degree of freedom also increases, and utilizes the Global optimal solution that the algorithm sampling of tradition colony obtains large-scale protein matter conformational space to become a challenging job;Tradition Swarm Evolution algorithm, when conformational space is sampled, can quickly navigate to the region at minimal solution place, but the later stage is more weak due to local enhancement ability in early stage, and convergence rate is relatively slow, and is easily ensnared into local optimum, it is impossible to obtain Global optimal solution.
Therefore, existing colony conformational space adopts method Shortcomings in ability in sampling and convergence rate, it is necessary to improve.
Summary of the invention
In order to overcome the deficiency in ability in sampling and convergence rate of the existing colony conformational space method of sampling, the present invention provide a kind of promote ability in sampling, improve convergence rate, improve precision of prediction based on the interim shifty colony conformational space method of sampling.
The technical solution adopted for the present invention to solve the technical problems is:
A kind of based on the interim shifty colony conformational space method of sampling, the described method of sampling comprises the following steps:
1) given list entries information;
2) from QUARK server (http://zhanglab.ccmb.med.umich.edu/QUARK/), distance spectrum file profile, rp are obtained according to sequence informationiFor the residue pair recorded in distance spectrum, DiFor this residue between distance, wherein i ∈ (1, N), N be in distance spectrum residue to quantity;
3) parameter is set: Population Size NP, the iterations G of algorithm, intersection factor CR, stage factor s, put iteration algebraically g=0;
4) initialization of population: produced NP initial configurations C by list entriesi, i={1 ..., NP}, the individual all position fragments of each conformation are assembled;
5) for each conformation individuality C in populationi, i ∈ 1,2,3 ..., and NP}, make Ctarget=Ci, CtargetRepresent that target conformation is individual, perform following operation and obtain variation conformation Cmutant:
5.1) stochastic generation positive integer rand1, rand2, rand3 ∈ 1,2,3 ... NP}, and rand1 ≠ rand2 ≠ rand3 ≠ i;Regeneration 4 random integers randrange1, randrange2, randrange3, randrange4;Wherein randrange1 ≠ randrange2, randrange3 ≠ randrange4 ∈ 1,2 ..., L}, L is sequence length;
5.2) a=min (randrange1, randrange2), b=max (randrange1, randrange2), k ∈ [a, b] are made;Make c=min (randrange3, randrange4), d=max (randrange3, randrange4), p ∈ [c, d];Wherein min represents the minima taking two numbers, and max represents the maximum taking two numbers
5.3) if g is < s G, then following operation is performed:
5.3.1) if randn (1,3)=1, then C is usedrand2Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position, then by gained Crand1Carry out fragment assembling and obtain variation conformation Cmutant, wherein randn (1,3) represents the integer between stochastic generation [1,3];
5.3.2) if randn (1,3)=2, then C is usedrand2Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand3Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.3.3) if randn (1,3)=3, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
5.4) if s is G≤g < 2s G, then following operation is performed:
5.4.1) according to energy, the conformation in whole population is carried out ascending order arrangement, then 0.5NP conformation individuality of the past selects a conformation at random and be designated as Cpbest
5.4.2) if randn (1,3)=1, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CpbestDihedral angle phi, psi, omega corresponding to same position, then by gained CpbestCarry out fragment assembling and obtain variation conformation Cmutant
5.4.3) if randn (1,3)=2, then C is usedpbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.4.4) if randn (1,3)=3, then C is usedpbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand1Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
5.5) if g >=2s is G, then following operation is performed:
5.5.1) whole population stochastic averagina is divided into three groups, it is judged that current goal conformation CtargetThe group at place, then selects the conformation C of minimum energy from corresponding grouplbest
5.5.2) if randn (1,3)=1, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CpbestDihedral angle phi, psi, omega corresponding to same position, then by gained CpbestCarry out fragment assembling and obtain variation conformation Cmutant
5.5.3) if randn (1,3)=2, then C is usedlbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.5.4) if randn (1,3)=3, then C is usedlbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand1Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
6) to variation conformation CmutantPerform to intersect and operate:
6.1) random number rand4, rand5, wherein rand4 ∈ (0,1), rand5 ∈ (1, L) are generated;
6.2) basis C t r a i l = C m u tan t , r a n d 5 &LeftArrow; C t arg e t , r a n d 5 , i f ( r a n d 4 &le; C R ) C m u tan t , r a n d 5 , o t h e r w i s e Perform crossover process: if random number rand4≤CR, make a variation conformation CmutantFragment rand5 replace with target conformation CtargetThe fragment of middle correspondence, is otherwise directly equal to variation conformation Cmutant
7) to target conformation CtargetWith test conformation CtrailCarry out selecting operation;
7.1) C is calculatedtargetAnd CtrailEnergy: E (Ctarget) and E (Ctrail);
7.2) if E is (Ctarget)>E(Ctrail), then CtrailReplace Ctarget, and forward step 8 to), otherwise continue executing with step 7.3);
7.3) target conformation C is calculated respectivelytargetWith test conformation CtrailMiddle residue is to rpiBetween the distance distance corresponding with distance spectrum between range differenceWith
7.4) judgeWithValue whether more than 6, if more than 6, then make it be equal to 6, remove the abnormal numerical value that some distance difference are bigger;
7.5) calculate respectively all residues in target conformation and test conformation between distance and distance spectrum in the meansigma methods of difference of distance, &dtri; D t arg e t = 1 N &Sigma; i = 1 N &dtri; D i t arg e t , &dtri; D t r a i l = 1 N &Sigma; i = 1 N &dtri; D i t r a i l ;
7.6) if Dtrail> Dtarget, then step 8 is forwarded to);
7.7) if Dtrail< Dtarget, then produce the random number rand6 between (0,1), if rand6 is less than 0.2, then use CtrailReplace Ctarget, otherwise enter step 8);
8) i=i+1;Judge that whether i is be more than or equal to NP, if it is g=g+1, otherwise enter step 9);
9) operating procedure 5 of iteration)~7), to meeting end condition.
The technology of the present invention is contemplated that: under differential evolution algorithm framework, whole algorithmic procedure is divided into multiple stage, each stage is arranged a group policy pond, when algorithm reaches certain stage, from the tactful pond of its correspondence, randomly select a strategy, based on fragment package technique, generate new test conformation individual, thus improving conformation ability in sampling and convergence of algorithm speed;Select link to introduce distance spectrum constraint in conformation simultaneously, when the energy testing conformation is higher than target conformation, then compare both range differences, if the range difference of test individuality is less, then with certain probability acceptance test conformation, thus bootstrap algorithm sampling obtains, energy is lower and the conformation of more reasonable structure, improves the precision of prediction of algorithm.
The present invention is under basic differential evolution algorithm framework, based on fragment package technique, in algorithm each stage, different new conformation generation strategy ponds is set, then from strategy pond, randomly select one and generate new conformation, improving convergence of algorithm speed and reliability, selecting link to add distance restraint simultaneously, bootstrap algorithm is sampled in the region that energy is low and rational in infrastructure, improve the ability in sampling of algorithm on the whole, thus improving precision of prediction.
The invention have the benefit that based on fragment package technique, adopt different conformation generation strategies in each stage of algorithm, improve convergence of algorithm speed and conformation ability in sampling;Distance spectrum introduces as auxiliary constraint and selects link, and bootstrap algorithm sampling obtains high-quality conformation.
Accompanying drawing explanation
Fig. 1 is based on conformation when protein 1AIL is sampled by the interim shifty colony conformational space method of sampling and updates schematic diagram.
The conformation scattergram that Fig. 2 obtains when being based on interim shifty colony conformational space method of sampling protein 1AIL sampling;
Detailed description of the invention
Below in conjunction with accompanying drawing, the invention will be further described.
With reference to Fig. 1~2, a kind of based on the interim shifty colony conformational space method of sampling, comprise the following steps:
1) given list entries information;
2) from QUARK server (http://zhanglab.ccmb.med.umich.edu/QUARK/), distance spectrum file profile, rp are obtained according to sequence informationiFor the residue pair recorded in distance spectrum, DiFor this residue between distance, wherein i ∈ (1, N), N be in distance spectrum residue to quantity;
3) parameter is set: Population Size NP, the iterations G of algorithm, intersection factor CR, stage factor s, put iteration algebraically g=0;
4) initialization of population: produced NP initial configurations C by list entriesi, i={1 ..., NP}, the individual all position fragments of each conformation are assembled;
5) for each conformation individuality C in populationi, i ∈ 1,2,3 ..., and NP}, make Ctarget=Ci, CtargetRepresent that target conformation is individual, perform following operation and obtain variation conformation Cmutant:
5.1) stochastic generation positive integer rand1, rand2, rand3 ∈ 1,2,3 ... NP}, and rand1 ≠ rand2 ≠ rand3 ≠ i;Regeneration 4 random integers randrange1, randrange2, randrange3, randrange4;Wherein randrange1 ≠ randrange2, randrange3 ≠ randrange4 ∈ 1,2 ..., L}, L is sequence length;
5.2) a=min (randrange1, randrange2), b=max (randrange1, randrange2), k ∈ [a, b] are made;Make c=min (randrange3, randrange4), d=max (randrange3, randrange4), p ∈ [c, d];Wherein min represents the minima taking two numbers, and max represents the maximum taking two numbers
5.3) if g is < s G, then following operation is performed:
5.3.1) if randn (1,3)=1, then C is usedrand2Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position, then by gained Crand1Carry out fragment assembling and obtain variation conformation Cmutant, wherein randn (1,3) represents the integer between stochastic generation [1,3];
5.3.2) if randn (1,3)=2, then C is usedrand2Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand3Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.3.3) if randn (1,3)=3, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
5.4) if s is G≤g < 2s G, then following operation is performed:
5.4.1) according to energy, the conformation in whole population is carried out ascending order arrangement, then 0.5NP conformation individuality of the past selects a conformation at random and be designated as Cpbest
5.4.2) if randn (1,3)=1, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CpbestDihedral angle phi, psi, omega corresponding to same position, then by gained CpbestCarry out fragment assembling and obtain variation conformation Cmutant
5.4.3) if randn (1,3)=2, then C is usedpbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.4.4) if randn (1,3)=3, then C is usedpbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand1Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
5.5) if g >=2s is G, then following operation is performed:
5.5.1) whole population stochastic averagina is divided into three groups, it is judged that current goal conformation CtargetThe group at place, then selects the conformation C of minimum energy from corresponding grouplbest
5.5.2) if randn (1,3)=1, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CpbestDihedral angle phi, psi, omega corresponding to same position, then by gained CpbestCarry out fragment assembling and obtain variation conformation Cmutant
5.5.3) if randn (1,3)=2, then C is usedlbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.5.4) if randn (1,3)=3, then C is usedlbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand1Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
6) to variation conformation CmutantPerform to intersect and operate:
6.1) random number rand4, rand5, wherein rand4 ∈ (0,1), rand5 ∈ (1, L) are generated;
6.2) basis C t r a i l = C m u tan t , r a n d 5 &LeftArrow; C t arg e t , r a n d 5 , i f ( r a n d 4 &le; C R ) C m u tan t , r a n d 5 , o t h e r w i s e Perform crossover process: if random number rand4≤CR, make a variation conformation CmutantFragment rand5 replace with target conformation CtargetThe fragment of middle correspondence, is otherwise directly equal to variation conformation Cmutant
7) to target conformation CtargetWith test conformation CtrailCarry out selecting operation;
7.1) C is calculatedtargetAnd CtrailEnergy: E (Ctarget) and E (Ctrail);
7.2) if E is (Ctarget)>E(Ctrail), then CtrailReplace Ctarget, and forward step 8 to), otherwise continue executing with step 7.3);
7.3) target conformation C is calculated respectivelytargetWith test conformation CtrailMiddle residue is to the range difference between the distance corresponding with distance spectrum of the distance between rpiWith
7.4) judgeWithValue whether more than 6, if more than 6, then make it be equal to 6, remove the abnormal numerical value that some distance difference are bigger;
7.5) calculate respectively all residues in target conformation and test conformation between distance and distance spectrum in the meansigma methods of difference of distance, &dtri; D t arg e t = 1 N &Sigma; i = 1 N &dtri; D i t arg e t , &dtri; D t r a i l = 1 N &Sigma; i = 1 N &dtri; D i t r a i l ;
7.6) if Dtrail> Dtarget, then step 8 is forwarded to);
7.7) if Dtrail< Dtarget, then produce the random number rand6 between (0,1), if rand6 is less than 0.2, then use CtrailReplace Ctarget, otherwise enter step 8);
8) i=i+1;Judge that whether i is be more than or equal to NP, if it is g=g+1, otherwise enter step 9);
9) operating procedure 5 of iteration)~7), to meeting end condition.
The present embodiment with sequence length be 73 protein 1AIL for embodiment, a kind of based on the interim shifty colony conformational space method of sampling, wherein comprise the steps of
1) sequence information of given protein 1AIL;
2) from QUARK server (http://zhanglab.ccmb.med.umich.edu/QUARK/), distance spectrum file profile, rp are obtained according to sequence informationiFor the residue pair recorded in distance spectrum, DiFor this residue between distance, wherein i ∈ (1, N), N=75 be in distance spectrum residue to quantity;
3) parameter is set: Population Size NP=30, the iterations G=10000 of algorithm, intersection factor CR=0.5, stage factor s=1/3, put iteration algebraically g=0;
4) initialization of population: produced NP initial configurations C by list entriesi, i={1 ..., NP}, the individual all position fragments of each conformation are assembled;
5) for each conformation individuality C in populationi, i ∈ 1,2,3 ..., and NP}, make Ctarget=Ci, CtargetRepresent that target conformation is individual, perform following operation and obtain variation conformation Cmutant:
5.1) stochastic generation positive integer rand1, rand2, rand3 ∈ 1,2,3 ... NP}, and rand1 ≠ rand2 ≠ rand3 ≠ i;Regeneration 4 random integers randrange1, randrange2, randrange3, randrange4;Wherein randrange1 ≠ randrange2, randrange3 ≠ randrange4 ∈ 1,2 ..., L}, L is sequence length;
5.2) a=min (randrange1, randrange2), b=max (randrange1, randrange2), k ∈ [a, b] are made;Make c=min (randrange3, randrange4), d=max (randrange3, randrange4), p ∈ [c, d];Wherein min represents the minima taking two numbers, and max represents the maximum taking two numbers
5.3) if g is < s G, then following operation is performed:
5.3.1) if randn (1,3)=1, then C is usedrand2Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position, then by gained Crand1Carry out fragment assembling and obtain variation conformation Cmutant, wherein randn (1,3) represents the integer between stochastic generation [1,3];
5.3.2) if randn (1,3)=2, then C is usedrand2Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand3Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.3.3) if randn (1,3)=3, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
5.4) if s is G≤g < 2s G, then following operation is performed:
5.4.1) according to energy, the conformation in whole population is carried out ascending order arrangement, then 0.5NP conformation individuality of the past selects a conformation at random and be designated as Cpbest
5.4.2) if randn (1,3)=1, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CpbestDihedral angle phi, psi, omega corresponding to same position, then by gained CpbestCarry out fragment assembling and obtain variation conformation Cmutant
5.4.3) if randn (1,3)=2, then C is usedpbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.4.4) if randn (1,3)=3, then C is usedpbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand1Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
5.5) if g >=2s is G, then following operation is performed:
5.5.1) whole population stochastic averagina is divided into three groups, it is judged that current goal conformation CtargetThe group at place, then selects the conformation C of minimum energy from corresponding grouplbest
5.5.2) if randn (1,3)=1, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CpbestDihedral angle phi, psi, omega corresponding to same position, then by gained CpbestCarry out fragment assembling and obtain variation conformation Cmutant
5.5.3) if randn (1,3)=2, then C is usedlbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Upper position c replaces C to dihedral angle phi, psi, omega corresponding for aminoacid institute p of the fragment of position drand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.5.4) if randn (1,3)=3, then C is usedlbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand1Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
6) to variation conformation CmutantPerform to intersect and operate:
6.1) random number rand4, rand5, wherein rand4 ∈ (0,1), rand5 ∈ (1, L) are generated;
6.2) basis C t r a i l = C m u tan t , r a n d 5 &LeftArrow; C t arg e t , r a n d 5 , i f ( r a n d 4 &le; C R ) C m u tan t , r a n d 5 , o t h e r w i s e Perform crossover process: if random number rand4≤CR, make a variation conformation CmutantFragment rand5 replace with target conformation CtargetThe fragment of middle correspondence, is otherwise directly equal to variation conformation Cmutant
7) to target conformation CtargetWith test conformation CtrailCarry out selecting operation;
7.1) C is calculatedtargetAnd CtrailEnergy: E (Ctarget) and E (Ctrail);
7.2) if E is (Ctarget)>E(Ctrail), then CtrailReplace Ctarget, and forward step 8 to), otherwise continue executing with step 7.3);
7.3) target conformation C is calculated respectivelytargetWith test conformation CtrailMiddle residue is to the range difference between the distance corresponding with distance spectrum of the distance between rpiWith
7.4) judgeWithValue whether more than 6, if more than 6, then make it be equal to 6, remove the abnormal numerical value that some distance difference are bigger;
7.5) calculate respectively all residues in target conformation and test conformation between distance and distance spectrum in the meansigma methods of difference of distance, &dtri; D t arg e t = 1 N &Sigma; i = 1 N &dtri; D i t arg e t , &dtri; D t r a i l = 1 N &Sigma; i = 1 N &dtri; D i t r a i l ;
7.6) if Dtrail> Dtarget, then step 8 is forwarded to);
7.7) if Dtrail< Dtarget, then produce the random number rand6 between (0,1), if rand6 is less than 0.2, then use CtrailReplace Ctarget, otherwise enter step 8);
8) i=i+1;Judge that whether i is be more than or equal to NP, if it is g=g+1, otherwise enter step 9);
9) operating procedure 5 of iteration)~7), to iterations G=10000 time.
With sequence length be 73 protein 1AIL for embodiment, above method is used to obtain the nearly native state conformation of this protein, the average root-mean-square deviation run between 30 obtained structures and native state structure is 3.75, lowest mean square root deviation is 2.83, conformation renewal figure in conformation assemblage as it is shown in figure 1, the conformation scattergram that obtains of sampling as shown in Figure 2.
The excellent results that the embodiment that the present invention provides that described above is shows, the obvious present invention is not only suitable for above-described embodiment, it can be done many variations and be carried out under not necessarily departing from essence spirit of the present invention and the premise without departing from content involved by flesh and blood of the present invention.

Claims (1)

1. one kind based on the interim shifty colony conformational space method of sampling, it is characterised in that: the described conformational space method of sampling comprises the following steps:
1) given list entries information;
2) from QUARK server, distance spectrum file profile, rp are obtained according to sequence informationiFor the residue pair recorded in distance spectrum, DiFor this residue between distance, wherein i ∈ (1, N), N be in distance spectrum residue to quantity;
3) parameter is set: Population Size NP, the iterations G of algorithm, intersection factor CR, stage factor s, put iteration algebraically g=0;
4) initialization of population: produced NP initial configurations C by list entriesi, i={1 ..., NP}, the individual all position fragments of each conformation are assembled;
5) for each conformation individuality C in populationi, i ∈ 1,2,3 ..., and NP}, make Ctarget=Ci, CtargetRepresent that target conformation is individual, perform following operation and obtain variation conformation Cmutant:
5.1) stochastic generation positive integer rand1, rand2, rand3 ∈ 1,2,3 ... NP}, and rand1 ≠ rand2 ≠ rand3 ≠ i;Regeneration 4 random integers randrange1, randrange2, randrange3, randrange4;Wherein randrange1 ≠ randrange2, randrange3 ≠ randrange4 ∈ 1,2 ..., L}, L is sequence length;
5.2) a=min (randrange1, randrange2), b=max (randrange1, randrange2), k ∈ [a, b] are made;Make c=min (randrange3, randrange4), d=max (randrange3, randrange4), p ∈ [c, d];Wherein min represents the minima taking two numbers, and max represents the maximum taking two numbers
5.3) if g is < s G, then following operation is performed:
5.3.1) if randn (1,3)=1, then C is usedrand2Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position, then by gained Crand1Carry out fragment assembling and obtain variation conformation Cmutant, wherein randn (1,3) represents the integer between stochastic generation [1,3];
5.3.2) if randn (1,3)=2, then C is usedrand2Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand3Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.3.3) if randn (1,3)=3, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
5.4) if s is G≤g < 2s G, then following operation is performed:
5.4.1) according to energy, the conformation in whole population is carried out ascending order arrangement, then 0.5NP conformation individuality of the past selects a conformation at random and be designated as Cpbest
5.4.2) if randn (1,3)=1, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CpbestDihedral angle phi, psi, omega corresponding to same position, then by gained CpbestCarry out fragment assembling and obtain variation conformation Cmutant
5.4.3) if randn (1,3)=2, then C is usedpbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.4.4) if randn (1,3)=3, then C is usedpbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand1Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
5.5) if g >=2s is G, then following operation is performed:
5.5.1) whole population stochastic averagina is divided into three groups, it is judged that current goal conformation CtargetThe group at place, then selects the conformation C of minimum energy from corresponding grouplbest
5.5.2) if randn (1,3)=1, then C is usedrand1Dihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CpbestDihedral angle phi, psi, omega corresponding to same position, then by gained CpbestCarry out fragment assembling and obtain variation conformation Cmutant
5.5.3) if randn (1,3)=2, then C is usedlbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces Crand1Dihedral angle phi, psi, omega corresponding to same position;Re-use Crand2Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces Crand1Dihedral angle phi, psi, omega corresponding to upper same position, then by gained Crand1Carry out fragment assembling to obtain testing individual Cmutant
5.5.4) if randn (1,3)=3, then C is usedlbestDihedral angle phi, psi, omega corresponding to upper position a to the aminoacid k of the fragment of position b replaces CtargetDihedral angle phi, psi, omega corresponding to same position;Re-use Crand1Dihedral angle phi, psi, omega corresponding to upper position c to the aminoacid p of the fragment of position d replaces CtargetDihedral angle phi, psi, omega corresponding to upper same position, then by gained CtargetCarry out fragment assembling to obtain testing individual Cmutant
6) to variation conformation CmutantPerform to intersect and operate:
6.1) random number rand4, rand5, wherein rand4 ∈ (0,1), rand5 ∈ (1, L) are generated;
6.2) basis C t r a i l = C m u tan t , r a n d 5 &LeftArrow; C t arg e t , r a n d 5 , i f ( r a n d 4 &le; C R ) C m u tan t , r a n d 5 , o t h e r w i s e Perform crossover process: if random number rand4≤CR, make a variation conformation CmutantFragment rand5 replace with target conformation CtargetThe fragment of middle correspondence, is otherwise directly equal to variation conformation Cmutant
7) to target conformation CtargetWith test conformation CtrailCarry out selecting operation;
7.1) C is calculatedtargetAnd CtrailEnergy: E (Ctarget) and E (Ctrail);
7.2) if E is (Ctarget)>E(Ctrail), then CtrailReplace Ctarget, and forward step 8 to), otherwise continue executing with step 7.3);
7.3) target conformation C is calculated respectivelytargetWith test conformation CtrailMiddle residue is to rpiBetween the distance distance corresponding with distance spectrum between range differenceWith
7.4) judgeWithValue whether more than 6, if more than 6, then make it be equal to 6, remove the abnormal numerical value that some distance difference are bigger;
7.5) calculate respectively all residues in target conformation and test conformation between distance and distance spectrum in the meansigma methods of difference of distance, &dtri; D t arg e t = 1 N &Sigma; i = 1 N &dtri; D i t arg e t , &dtri; D t r a i l = 1 N &Sigma; i = 1 N &dtri; D i t r i a l ;
7.6) if Dtrail> Dtarget, then step 8 is forwarded to);
7.7) if Dtrail< Dtarget, then produce the random number rand6 between (0,1), if rand6 is less than 0.2, then use CtrailReplace Ctarget, otherwise enter step 8);
8) i=i+1;Judge that whether i is be more than or equal to NP, if it is g=g+1, otherwise enter step 9);
9) operating procedure 5 of iteration)~7), to meeting end condition.
CN201610121504.8A 2016-03-03 2016-03-03 One kind is based on interim shifty group's conformational space method of sampling Active CN105808973B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610121504.8A CN105808973B (en) 2016-03-03 2016-03-03 One kind is based on interim shifty group's conformational space method of sampling

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610121504.8A CN105808973B (en) 2016-03-03 2016-03-03 One kind is based on interim shifty group's conformational space method of sampling

Publications (2)

Publication Number Publication Date
CN105808973A true CN105808973A (en) 2016-07-27
CN105808973B CN105808973B (en) 2018-06-01

Family

ID=56466626

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610121504.8A Active CN105808973B (en) 2016-03-03 2016-03-03 One kind is based on interim shifty group's conformational space method of sampling

Country Status (1)

Country Link
CN (1) CN105808973B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107633159A (en) * 2017-08-21 2018-01-26 浙江工业大学 A kind of protein conformation space search method based on Distance conformability degree
CN108416482A (en) * 2018-04-03 2018-08-17 浙江工业大学 One kind is based on regional shifty logistics distribution paths planning method
CN109378035A (en) * 2018-08-29 2019-02-22 浙江工业大学 A kind of Advances in protein structure prediction based on secondary structure dynamic select strategy
CN109461471A (en) * 2018-08-30 2019-03-12 浙江工业大学 A kind of adaptive Advances in protein structure prediction based on championship mechanism
CN109509510A (en) * 2018-07-12 2019-03-22 浙江工业大学 A kind of Advances in protein structure prediction based on assemblage Mutation Strategy on multiple populations
CN109584954B (en) * 2018-11-05 2021-04-06 浙江工业大学 Protein conformation space optimization method based on multi-population joint search

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104951670A (en) * 2015-06-08 2015-09-30 浙江工业大学 Group conformation space optimization method based on distance spectrum

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104951670A (en) * 2015-06-08 2015-09-30 浙江工业大学 Group conformation space optimization method based on distance spectrum

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
DONG XU ET AL: "Ab Initio Protein Structure Assembly Using Continuous Structure Fragments and Optimized Knowledge-based Force Field", 《PROTEINS-STRUCTURE FUNCTION & BIOINFORMATICS》 *
GUANGZHENG ZHANG ET AL: "Combing genetic algorithm with neural network technique for protein inter-residue spatial distance prediction", 《NEURAL NETWORKS, 2004. PROCEEDINGS》 *
PETER M. BOWERS ET AL: "De novo protein structure determination using sparse NMR data", 《JOURNAL OF BIOMOLECULAR NMR》 *
PREDRAG KUKIC ET AL: "Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks", 《BMC BIOINFORMATICS》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107633159A (en) * 2017-08-21 2018-01-26 浙江工业大学 A kind of protein conformation space search method based on Distance conformability degree
CN107633159B (en) * 2017-08-21 2020-06-02 浙江工业大学 Protein conformation space search method based on distance similarity
CN108416482A (en) * 2018-04-03 2018-08-17 浙江工业大学 One kind is based on regional shifty logistics distribution paths planning method
CN108416482B (en) * 2018-04-03 2021-08-03 浙江工业大学 Logistics distribution path planning method based on regional multiple strategies
CN109509510A (en) * 2018-07-12 2019-03-22 浙江工业大学 A kind of Advances in protein structure prediction based on assemblage Mutation Strategy on multiple populations
CN109509510B (en) * 2018-07-12 2021-06-18 浙江工业大学 Protein structure prediction method based on multi-population ensemble variation strategy
CN109378035A (en) * 2018-08-29 2019-02-22 浙江工业大学 A kind of Advances in protein structure prediction based on secondary structure dynamic select strategy
CN109378035B (en) * 2018-08-29 2021-02-26 浙江工业大学 Protein structure prediction method based on secondary structure dynamic selection strategy
CN109461471A (en) * 2018-08-30 2019-03-12 浙江工业大学 A kind of adaptive Advances in protein structure prediction based on championship mechanism
CN109461471B (en) * 2018-08-30 2020-12-01 浙江工业大学 Adaptive protein structure prediction method based on championship mechanism
CN109584954B (en) * 2018-11-05 2021-04-06 浙江工业大学 Protein conformation space optimization method based on multi-population joint search

Also Published As

Publication number Publication date
CN105808973B (en) 2018-06-01

Similar Documents

Publication Publication Date Title
CN105808973A (en) Staged multi-strategy-based group conformation space sampling method
Terwilliger et al. Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard
CN106778059B (en) A kind of group&#39;s Advances in protein structure prediction based on Rosetta local enhancement
CN105205348A (en) Method for colony conformation space optimization based on distance constraint selection strategy
CN108846256B (en) Group protein structure prediction method based on residue contact information
CN103473482B (en) Based on the prediction method for three-dimensional structure of protein that differential evolution and conformational space are annealed
CN106055920B (en) It is a kind of based on the Advances in protein structure prediction that tactful copy exchanges more than stage
CN109086566B (en) Group protein structure prediction method based on fragment resampling
CN106096326B (en) A kind of differential evolution Advances in protein structure prediction based on barycenter Mutation Strategy
CN109524058A (en) A kind of protein dimer Structure Prediction Methods based on differential evolution
CN109509510B (en) Protein structure prediction method based on multi-population ensemble variation strategy
CN109346128B (en) Protein structure prediction method based on residue information dynamic selection strategy
CN105912885B (en) A kind of Advances in protein structure prediction based on colony&#39;s Lipschitz Lower Bound Estimations
CN108595910A (en) A kind of group&#39;s protein conformation space optimization method based on diversity index
CN113539364A (en) Method for predicting protein phosphorylation by deep neural network framework
CN109448785B (en) Protein structure prediction method for enhancing Loop region structure by using Laplace graph
CN109461471B (en) Adaptive protein structure prediction method based on championship mechanism
CN109448786B (en) Method for predicting protein structure by lower bound estimation dynamic strategy
CN109243526B (en) Protein structure prediction method based on specific fragment crossing
CN109360600B (en) Protein structure prediction method based on residue characteristic distance
CN109326318B (en) Group protein structure prediction method based on Loop region Gaussian disturbance
CN109461472B (en) Protein conformation space optimization method based on replica exchange and biased distribution estimation
CN110706741B (en) Multi-modal protein structure prediction method based on sequence niche
Zhang et al. Distance-AF: Modifying Predicted Protein Structure Models by Alphafold2 with User-Specified Distance Constraints
CN108629151B (en) Group protein structure prediction method based on Laplace graph

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant