CN114121146B

CN114121146B - RNA tertiary structure prediction method based on parallel and Monte Carlo strategies

Info

Publication number: CN114121146B
Application number: CN202111428461.5A
Authority: CN
Inventors: 刘振栋; 杨玉荣; 李冬雁; 陈曦; 吕欣荣; 秦梦颖; 柏苛; 何志强; 李晓峰; 王少华; 胡国胜
Original assignee: Shandong Jianzhu University
Current assignee: Shandong Jianzhu University
Priority date: 2021-11-29
Filing date: 2021-11-29
Publication date: 2023-10-03
Anticipated expiration: 2041-11-29
Also published as: CN114121146A

Abstract

The invention discloses an RNA tertiary structure prediction method based on parallel and Monte Carlo strategies, and belongs to the field of structure prediction. The method comprises performing conformational space sampling by using a parallel mechanism; scoring according to the latest updated energy function; performing rationality judgment on Monte Carlo operation of which the conformation is based on 'Stepwise ansatz' through two rounds of potential energy judgment; and finally, judging the structural integrity and modeling accuracy, and processing the result until a stable RNA tertiary structure with high accuracy and high integrity is obtained. The RNA tertiary structure prediction method provided by the invention can obtain the RNA tertiary structure with high precision and high integrity. The RNA three-level structure prediction method based on the parallel and Monte Carlo strategies has high flexibility, the Monte Carlo times can be specified, and the modeling precision and the modeling time cost can be measured by a user; the method solves the problem that the modeling of the RNA motif is incomplete in the prior art; the invention increases the breadth and depth of the conformational sampling, reduces the influence of the pseudo minimum free energy and improves the modeling precision.

Description

RNA tertiary structure prediction method based on parallel and Monte Carlo strategies

Technical Field

The invention belongs to the field of structure prediction, and particularly relates to an RNA tertiary structure prediction method based on parallel and Monte Carlo strategies.

Background

New studies have found that RNA has some complex biological functions. The structure determines the function, so that it is necessary to know the structure of RNA in advance in order to explore the function of RNA. At present, two methods for determining RNA tertiary structure at home and abroad are mainly available. The first method is to use experimental measurement methods such as x-ray, nuclear magnetic resonance and a frozen electron microscope, and the result obtained by the experimental method is relatively accurate and reliable, but the number of conformations increases exponentially with the increase of the length of the RNA, so that the cost is high. The second method is a structure prediction method based on biological calculation, and the current RNA tertiary structure prediction algorithm is mainly a knowledge mining-based prediction method and a physical prediction method. Knowledge mining-based three-level structure prediction methods rely on a library of known RNA templates; the physical-based prediction method reduces the dependence on the database, but still has the problem that the structural modeling precision is not high enough, and cannot meet the current structural prediction requirement. Thus, for this current situation, we need to innovate the existing method.

In the protein domain, there is a hypothesis that the native conformation of a macromolecule has the lowest free energy, and that the free energy function approximates the sum of hydrogen bonds, van der Waals forces, electrostatic forces, and solvation terms. However, the results obtained by applying the method of protein research to RNA research are poor due to the different folding modes of proteins and RNA molecules. Therefore, we still assume that the macromolecular native conformation has the lowest free energy, but assign different weights to different tertiary interactions, and linearly add to obtain the free energy, against the drawbacks of the prior art. In addition, aiming at the limitation of the single-thread conformational ability, a parallel mechanism is adopted, and meanwhile, multiple judgment is carried out on a modeling result, so that a gradual Monte Carlo parallelization method (SMCP) which is a method specially used for predicting the RNA tertiary structure is obtained.

Disclosure of Invention

Aiming at the defects of the existing RNA structure prediction method, the invention provides an RNA tertiary structure prediction method SMCP based on parallel and Monte Carlo strategies. The SMCP increases the breadth and depth of conformational sampling through a parallel mechanism, screens intermediate results through multiple potential energy judgment, increases the integrity of the results through result judgment and improves modeling accuracy. The method aims to solve the defects of single line Cheng Gou image sampling and the problems of low modeling integrity and precision in the current RNA structure prediction method.

The RNA tertiary structure prediction method based on parallel and Monte Carlo strategies comprises the following steps:

(1) Initializing an RNA motif, and determining the parallel stroke number n and the Monte Carlo times m; n is a natural number greater than 1, m is a natural number of 200-50000;

(2) Constructing a conformational space for the RNA motif, performing efficient conformational sampling on the conformational space by using a parallel mechanism and a 'Stepwise ansatz' hypothesis, performing operations such as adding, deleting, combining, resampling and the like on single nucleotide, and performing multiple random operations to obtain a candidate conformational set;

(3) Calculating potential energy value of the candidate conformation obtained in the step (2) by using an energy function, wherein the biomolecule potential energy value is approximate to Rosetta energy function value, and Rosetta energy value is delta E according to a formula _total ＝∑ _i ω _i E _i (Θ _i ,aa _i ) Calculating the linear sum of all energy terms scaled by weight, wherein E _i Is the energy term, ω _i Is the weight of each energy term, Θ _i Is the geometrical degree of freedom aa _i Is a chemical identity; in addition, the calculation process needs to be based on a connection weight formulaTo calculate potential energy E of each energy term _x Wherein E is _x Is the potential energy value of the energy term x;

(4) Judging the conformational potential energy value obtained in the step (3), wherein after the random operation of the step (2), the change of the potential energy value determines whether the operations of adding, deleting, conformational merging, resampling and the like of the nucleotide can be accepted or not; according to the standard:

determining whether random manipulation of the nucleotide is acceptable, wherein the metapolis criterion is defined by the formula:representing, obtaining a real candidate conformation set after preliminary potential energy judgment;

(5) Further potential energy judgment is carried out on the candidate conformation set obtained in the step (4), and the conformation structure with a low potential energy value is more stable, so that the conformation with the lowest potential energy value is selected as the current best candidate conformation by integrating all threads;

(6) Performing precision calculation on the current best candidate conformation obtained in the step (5), wherein RMSD is an important index for describing structural similarity of two conformations of a molecule; according to the formulaTo calculate RMSD, thereby describing modeling accuracy, wherein +.>Is the distance between atom j and the reference conformation or the average position of m equivalent atoms; in addition, rigid stacking is typically performed to minimize RMSD and then return the minimum value as the final precision value. According to the formula

Calculation, wherein n, v represent given two points;

(7) Performing accuracy judgment on the current best candidate conformation obtained in the step (5-6); we consider that the predicted conformation is in agreement with the experimentally determined conformational errorWithin this, the predicted conformation is the native conformation (i.e. modeling accuracy is required +.>) The method comprises the steps of carrying out a first treatment on the surface of the Therefore, the judgment is performed:

(8) Carrying out integrity judgment on the current best candidate conformation obtained in the step (7), and judging

(9) The conformation obtained in the step (8) is a high-precision high-integrity conformation, a final modeling result is obtained, visual analysis is carried out by using UCSF Chimer, and comparison analysis can be carried out on the conformation measured through experiments and the conformation predicted by the RNA tertiary structure prediction method through the UCSF Chimer.

Preferably, n in the step (1) has a value of 3, and m has a value of 10000;

preferably, the step (3) of obtaining conformational potential energy value comprises the steps of:

(1-1) calculating the energy of the atomic pair interactions. The atomic pair inter/intra interactions include: van der waals forces, electrostatic forces, solvation terms, hydrogen bonding forces, disulfide bonding forces. Energy terms embodying atomic pair interactions include: fa_rep, fa_intra_rep, fa_atr, fa_elec, fa_sol, lk_ball_wtd, hbond_sc, dslf_fal3, hbond_lr_bb, hbond_sr_bb, hbond_bb_sc;

(1-2) calculating the energy related to the torsion of the protein backbone and the side chains. The term indicating the torsion angle is: the pull-type diagram, the backbone design term, and the side chain conformation, the relevant energy terms include: rama_prepro, p_aa_pp, fa_ dun;

(1-3) calculating the energy of the torsion term (peptide bond dihedral angle) under special conditions. Related energy terms include omega, pro_close,

yhh_plannarity；

(1-4) calculating the energy of the non-ideal bond length and angle (Cartesian product bond energy). The relevant energy terms include: a cart_bound;

(1-5) the energy terms of all energy functions under the Rosetta framework are the same, and the difference between different energy functions is the difference of the weight values of the energy terms;

according to formula E _total ＝ω _{fa_rep} E _{fa_rep} +ω _{fa_intra_rep} E _{fa_intra_rep} +ω _{fa_atr} E _{fa_atr} +ω _{fa_} _elec E _{fa_elec} E _{fa_elec} +ω _{fa_sol} E _{fa_sol} +ω _{lk_ball_wtd} E _{lk_ball_wtd} +ω _{hbond_sc} E _{hbond_sc} +ω _{dslf_fal3} E _{dslf_fal3} +ω _{hbond_lr_bb} E _{hbond_lr_bb} +ω _{hbond_sr_bb} E _{hbond_sr_bb} +ω _{hbond_bb_sc} E _{hbond_bb_sc} +ω _{rama_prepro} E _{rama_prepro} +ω _{p_aa_pp} E _{p_aa_pp} +ω _{fa_dun} E _{fa_dun} +ω _omega E _omega +ω _{pro_close} E _{pro_close} +ω _{yhh_plannarity} E _{yhh_plammarity} +ω _{cart_bonded} E _{cart_bonded} And calculating the weighted sum of all energy items in the steps, wherein ωx is the weight of the energy item x, and obtaining the potential energy value of the candidate conformation after calculation.

Preferably, step (4) is centered on calculating the system energy change Δe. The Metropolis criterion described in step (4) is according to the formulaTo determine an acceptance criterion. Where df is the difference in fitness between the new conformation and the original conformation, i.e. df=f (new) -f (old); t is a control parameter of the annealing process.

Compared with the prior art, the method has the beneficial effects that:

the algorithm innovates the RNA tertiary structure prediction algorithm, and realizes efficient structure prediction. The algorithm is based on "Stepwise ansatz"

It is assumed that by manipulating a single nucleotide, the need to enumerate all conformations at once is avoided; the structure is predicted by randomly sampling the conformation added with single nucleotide, so that the modeling stage without depending on fragments or coarse granularity is realized, the calculated amount is reduced, and the modeling time is saved; and the algorithm is optimized by utilizing parallelization, multiple programs are operated simultaneously, the prediction precision and modeling integrity are improved by screening layer by layer according to the energy value, and the modeling time is saved.

Drawings

FIG. 1 is a schematic diagram of a parallel mechanism;

FIG. 2 is a flow chart of the SMCP method;

FIG. 3 is an example of predicting RNA tertiary structure using the SMCP method;

FIG. 4 is a graph comparing modeling accuracy results of predicting RNA tertiary structure using the SMCP method and SWM method under Rosetta framework.

Detailed Description

In order to clearly illustrate the technical solution of the present invention, the present invention is described below with reference to the accompanying drawings (1-3) and examples, which are provided herein for the purpose of illustrating the present invention only and are not limiting.

Fig. 1 shows a schematic diagram of serial sampling and parallel sampling. When a serial sampling method is adopted, random search is started from s to perform conformational sampling, and the position of the local minimum energy can be found through a Monte Carlo mechanism; however, the searchability of single-threaded is limited, it is difficult to find the true lowest energy across energy barriers, and the lowest potential of the conformation obtained by single-threaded conformational search may be pseudo-lowest potential, resulting in low prediction accuracy of the RNA tertiary structure prediction method. When the parallel sampling method is adopted, a plurality of threads start to randomly search the same conformational space at different initial positions s, and all threads can obtain a local minimum energy valley; and the local conformation samples obtained by sampling all threads are comprehensively processed, so that the probability of obtaining the actual lowest energy valley in the conformation space is increased, and the high-quality samples are obtained, thereby improving the prediction precision.

FIG. 2 shows the steps of the flow of the SMCP method for predicting RNA tertiary structure. An example of a selected RNA motif is l1_sam_ll_riboswitch (PDB number: 2QWY, motif length: 7, sequence: GCAGUCG). The input of the SMPP method is provided with two 3D structure files in the pdb format, one is the initial conformation of the l1_sam_ll_riboswitch motif, and the SMPP method is modeled on the basis of the structure; the other is the native conformation of the l1_sam_ll_riboswitch motif, i.e. the experimentally determined structure, compared with the structure predicted by the SMCP method for prediction accuracy analysis of the structure prediction method. In addition, 1 fasta sequence file, 1 flag command operation file, and the number of specified threads n=3, and the number of monte carlo times m=10000 are also required to be input. The output of the SMCP method is the RNA tertiary structure predicted by the method and the structure prediction precision. The following is a specific step of RNA tertiary structure prediction:

1. conformational sampling

The parallel mechanism and the 'Stepwise ansatz' are used to assume that the conformational space is subjected to efficient conformational sampling (the conformational space contains 7 nucleotides: GCAGUCG), on the premise of knowing the structure of the GCUCG, random operations such as adding, deleting, resampling and the like are performed on the nucleotides A and G, and a candidate conformational set is obtained through 10000 times of random Monte Carlo operations, and the sampling process is as follows (only by way of example).

Resampling success operation (9904 th sample is taken as an example):

(1) Modeling 1-2 4-7

(2) Modeling mobile nucleotide No. 4G linked to nucleotide No. 5U

(3) RMSD 1.512 (atom 23 of nucleotide G4) superimposed on atom 86 of nucleotide 1-2 5-7 (RMSD 0.0000007)

(4) Number of attempts: 10000, the successful times are 13;

resampling failure operation (9999 th sampling for example):

(1) Modeling 1-3 5-7

(2) Modeling mobile nucleotide 3A linked to nucleotide 2C

(3) RMSD 3.536 (22 atoms of nucleotide A No. 3) superimposed on the 86 atom of nucleotide 1-2 5-7 (RMSD 0.0000005)

(4) Number of attempts: 3092, number of successes is 20;

the addition of a failed operation (9998 th sampling for example) is standard consistent with resampling, so we just take the failed example:

(1) Modeling 1-3 5-7;

(2) When nucleotide G No. 4 is added, it is linked to nucleotide A No. 3;

(3) RMSD 5.777 (atom No. 27 of nucleotide a 3), superimposed to atom No. 86 of the other nucleotides (RMSD 0.0000008);

(4) Number of attempted addition positions: 100000, times of success, 17;

the delete failure operation (10000 samples are taken as an example), the standard is consistent with resampling, so only one example of failure is given:

(1) Modeling 1-3 5-7;

(2) Deleting nucleotide No. 3 a linked to nucleotide No. 2C;

(3) RMSD0.000, superimposed to atom number 86 of the other nucleotides (RMSD 0.0000003);

2. scoring of energy functions

The potential energy value of the candidate conformation obtained by sampling is calculated by utilizing an energy function, the potential energy value of the biological molecule is approximate to the Rosetta energy function value, and the Rosetta energy value is delta E according to the formula _total ＝∑ _i ω _i E _i (Θ _i ,aa _i ) Calculating the linear sum of all energy terms scaled by weight, wherein E _i Is the energy term, ω _i Is the weight of each energy term, Θ _i Is the geometrical degree of freedom aa _i Is a chemical identity; in addition, the calculation process needs to be based on a connection weight formulaTo calculate potential energy E of each energy term _x Wherein E is _x Is the potential energy value of the x energy term, and the calculation process comprises the following steps:

(1-3) calculating the energy of the torsion term (peptide bond dihedral angle) under special conditions. Related energy terms include omega, pro_close, yhh _planness;

(1-5) the energy terms of all energy functions under the Rosetta framework are the same, and the difference between different energy functions is the difference of the weight values of the energy terms; and calculating the weighted sum of all energy items in the steps to obtain potential energy values of candidate conformations.

After calculation of the energy function, the potential energy of different random operations is changed as follows:

resampling successful operating potential energy value change (9904 th sample is taken as an example): -5.247→ -7.460, potential energy value decrease (initial structural potential energy value: -5.247);

resampling failure operating potential energy value change (9999 th sampling for example): -2.184 → -3.170 potential energy value decrease (initial structural potential energy value: -2.184);

add failed operating potential value change (9998 th sample for example): 17.900 to-1.671, potential energy value decrease (initial structural potential energy value: 15.326);

deletion failure operation potential value change (taking 10000 th sampling as an example): 3.702 → -3.702, the potential energy value is unchanged (initial structural potential energy value: -3.702);

3. potential energy evaluation further determines conformation

The core of the potential energy judgment is to calculate the energy change delta E of the system. The change of potential energy value determines whether the operations of adding and deleting nucleotide, combining conformation, resampling and the like can be accepted;

wherein the metapolis standard exploits the concept of monte carlo. At the rise of energy, a random number α between 0 and 1 is generated and compared with exp (ΔE/kT), if α>exp (-DELTAE/kT) refuses the acceptance; otherwise, the method is accepted, and a real candidate conformation set is obtained.

And judging each operation, and finally selecting to accept or reject, wherein the potential energy judging process is as follows:

resampling success operation (9904 th sample is taken as an example):

(1) The inverse operation of resampling nucleotide G4 linked to nucleotide U5 is performed: resampling nucleotides 1-2 5-7 attached to nucleotide No. 4G;

(2) After execution, modeling is 1-2 4-7;

(3) Potential energy value change: -6.82358-7.46016, potential energy value is reduced;

(4) Is the monte carlo operation accepted? Acceptance (both original and reverse potential values reduced);

resampling failure operation (9999 th sampling for example):

(1) Performing the inverse of nucleotide a resampling number 3 linked to nucleotide C number 2: resampling nucleotide No. 3 a linked to nucleotide No. 2C;

(2) After execution, modeling is 1-3 5-7;

(3) Potential energy value change: -6.33765-3.16991, potential energy value increases;

(4) Is the monte carlo operation accepted? Refusal (original operating potential value decreases, reverse operating potential value increases);

add failure operation (9998 th sample, for example):

(1) Performing the reverse operation to delete nucleotide No. 4G linked to nucleotide No. 3 a;

(2) Modeling 1-7 after deleting;

(3) Potential energy value change: -6.33765-1.67202, potential energy value increases;

delete failure operation (taking sample 10000 as an example):

(1) Performing the reverse operation, adding nucleotide No. 3 a linked to nucleotide No. 2C;

(2) After execution, modeling is 1-2 5-7;

(3) Potential energy value change: -6.33765 → -3.70202 potential energy value decrease (initial structural potential energy value: -3.702, no change from initial value);

(4) Is the monte carlo operation accepted? Refusing (original operation potential value is unchanged, the reverse operation potential value is reduced, but the reverse operation potential value is consistent with the initial value);

4. multithreading comprehensive potential energy judgment

The conformation structure with the low potential energy value is more stable, so that all threads are synthesized to select the conformation with the lowest potential energy value as the current best candidate conformation. The conformational potential values obtained for the 3 threads made by l1_sam_ll_riboswitch motif are respectively: -11.133REU (REU: rosetta Energy Units), -10.123REU, -12.155REU, according to the principle: the lower the potential energy value of the structure, the more stable the structure, and the conformation with the lowest potential energy value, namely the conformation with the potential energy value of-12.155 REU, is selected.

5. Modeling accuracy calculation

RMSD is an important indicator describing the structural similarity of two conformations of a molecule; according to basic formulaTo calculate RMSD to describe modeling accuracy, where δ _i Is the distance between atom i and the reference conformation or the average position of n equivalent atoms; a rigid overlay is typically performed during the calculation to minimize RMSD and then return this minimum value as the final precision value RMSD. At this time, it is required to use the formulaTo calculate RMSD where n, v represent given two points. The modeling precision of the l1_sam_ll_riboswitch die body is obtained by the calculation formula>Its potential energy value is-12.155 REU.

6. Modeling accuracy and integrity determination and processing

Performing accuracy judgment on the current optimal candidate conformation; judgingThe current best conformation modeling accuracy isThe precision requirement can be met, and the re-modeling is not needed;

integrity judgment is carried out on the current best candidate conformation, and judgment is carried out

The missing value of the current conformation is 0, which indicates that the SMCP method has completed complete modeling of the 7 nucleotides of GCAGUCG;

after 10000 modeling passes on l1_sam_ll_riboswitch motif are completed, the statistics related to the monte carlo random operation are as follows:

(1) The number of times of addition: 1095; acceptance rate: 0.2868;

(2) Number of deletions: 3968; acceptance rate: 0.0769;

(3) Number of resampling: 4937; acceptance rate: 0.4588;

and (3) adding the A and G nucleotides to the 24 th and 25 th positions on the A chain through random Monte Carlo parallelization sampling, and judging and processing the precision and the integrity, wherein the conformation is the final modeling result.

7. Conformational visualization analysis

The high-precision high-integrity structure obtained after modeling is subjected to visual analysis by using UCSF (unified control system) Chimer, and the conformation measured by an experiment and the conformation predicted by an SMCP (surface-controlled processing) method can be subjected to contrast analysis by using the UCSF Chimer. The comparison result is shown in fig. 3, wherein the diagram a is the experimental measurement structure of the l1_sam_ll_riboswitch motif, the diagram B is the structure of the l1_sam_ll_riboswitch motif predicted by the SMCP method, and as can be seen from the diagrams a and B in fig. 3, the RNA structure predicted by the SMCP method has extremely high similarity with the experimental measurement real structure; from the modeling result data, the RMSD of the SMCP method modeling the l1_sam_ll_riboswitch motif isWhereas the RMSD obtained by predicting SWM by the current best method using RNA tertiary structure under Rosetta frame is +.>The SMCP method is higher in accuracy than the SWM method in predicting the RNA tertiary structure.

Modeling a benchmark consisting of 9 RNAs using the SMCP and Rosetta framework SWM method, fig. 4 is a graph comparing RMSD results of three-level structure modeling of the benchmark using the SMCP method with the SWM method, where the abscissa is RMSD of three-level structure of RNA predicted using the SWM method, the ordinate is RMSD of three-level structure of RNA predicted using the SMCP method, and each dot/square in the graph represents one RNA motif.

As can be seen from fig. 4, when the structure prediction is performed on 9 RNA motifs in the reference, the RMSD value obtained by modeling by the SMCP method is lower, that is, the modeling precision is higher, which indicates that the SMCP method takes a dominant role in the high-precision modeling field of the RNA tertiary structure, and the prediction precision is higher when the SMCP method predicts the RNA tertiary structure;

in FIG. 4, 2 RNA motifs with black square marks exist, and when RNA tertiary structure prediction is performed by using the SWM method, complete modeling of the 2 RNA motifs cannot be realized; whereas complete modeling of all nucleotides in these 2 RNA motifs can be achieved using the SMCP method, indicating that the SMCP method predicts higher integrity of the prediction when predicting RNA tertiary structure.

Claims

1. The RNA tertiary structure prediction method based on the parallel and Monte Carlo strategies is characterized by comprising the following steps:

(2) Constructing a conformational space for the RNA motif, performing efficient conformational sampling on the conformational space by using a parallel mechanism and a 'Stepwise ansatz' hypothesis, performing single nucleotide addition, deletion, merging and resampling operation, and performing multiple random operations to obtain a candidate conformational set;

(3) Calculating potential energy value of the candidate conformation obtained in the step (2) by using an energy function, wherein the biomolecule potential energy value is approximate to Rosetta energy function value, and Rosetta energy value is according to a formulaCalculating the linear sum of all energy terms scaled by weight, wherein +.>Is an energy item, +.>Is the weight of each energy term, +.>Is a degree of freedom of the geometry,is a chemical identity; in addition, the calculation process needs to be carried out according to the connection weight formula +.>To calculate potential energy of each energy item;

the step (3) of obtaining conformational potential energy value comprises the following steps:

(1-1) calculating energy of atomic pair interactions: the atomic pair inter/intra interactions include: van der waals forces, electrostatic forces, solvation terms, hydrogen bonding forces, disulfide bonding forces; energy terms embodying atomic pair interactions include:，/>，，/>，/>，/>，/>，/>，/>，，/>；

(1-2) calculating the energy related to protein backbone and side chain torsion: the term indicating the torsion angle is: the pull-type diagram, the backbone design term, and the side chain conformation, the relevant energy terms include:，/>，/>；

(1-3) calculating the energy of the dihedral angle of the peptide bond of the torsion term: the relevant energy term includes:，/>，；

(1-4) calculating the energy of the non-ideal bond length and the angular Cartesian product bond energy: the relevant energy terms include:；

(1-5) the energy terms of all energy functions under the Rosetta framework are the same, and the difference between different energy functions is the difference of the weight values of the energy terms; according to the formula

Calculating the weighted sum of all energy items in the steps;

(4) Judging the conformational potential energy value obtained in the step (3), wherein after the random operation in the step (2), the change of the potential energy value determines whether the operations of adding and deleting nucleotides, conformational merging and resampling can be accepted or not; according to the standard:determining whether a random manipulation of the nucleotide is acceptable, wherein the metapolis criterion is defined by the formula:representing, obtaining a real candidate conformation set after preliminary potential energy judgment; alpha is a random number between 0 and 1;

(6) Performing precision calculation on the current best candidate conformation obtained in the step (5), wherein RMSD is an important index for describing structural similarity of two conformations of a molecule; according to the formulaTo calculate RMSD, thereby describing modeling accuracy, wherein +.>Is the distance between atom j and the average position of m equivalent atoms; in addition, a rigid overlay would be performed to minimize RMSD and then return the minimum as the final precision value; according to the formulaCalculation of>，/>Representing a given two points;

(7) Performing accuracy judgment on the current best candidate conformation obtained in the steps (5) - (6); when the predicted conformation and the experimentally determined conformation error are within 2 a, the predicted conformation is the native conformation, i.e., the modeling accuracy is required for RMSD2Å；

Therefore, the judgment is performed:；

(8) Carrying out integrity judgment on the current best candidate conformation obtained in the step (7), and judging；

(9) The conformation obtained in the step (8) is a conformation with high precision and high integrity, and the predicted final RNA tertiary structure is obtained.

2. The method for predicting the three-level structure of RNA based on the parallel and Monte Carlo strategies according to claim 1, wherein n in the step (1) has a value of 3 and m has a value of 10000.