CN101717822A - RNA sequence secondary structure prediction method based on base fragment coding and ant colony optimization - Google Patents

RNA sequence secondary structure prediction method based on base fragment coding and ant colony optimization Download PDF

Info

Publication number
CN101717822A
CN101717822A CN200910218023A CN200910218023A CN101717822A CN 101717822 A CN101717822 A CN 101717822A CN 200910218023 A CN200910218023 A CN 200910218023A CN 200910218023 A CN200910218023 A CN 200910218023A CN 101717822 A CN101717822 A CN 101717822A
Authority
CN
China
Prior art keywords
rna sequence
secondary structure
coding
prediction method
structure prediction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN200910218023A
Other languages
Chinese (zh)
Inventor
刘元宁
余军
王刚
张�浩
高娜
张晓旭
王俊勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jilin University
Original Assignee
Jilin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jilin University filed Critical Jilin University
Priority to CN200910218023A priority Critical patent/CN101717822A/en
Publication of CN101717822A publication Critical patent/CN101717822A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention relates to a RNA sequence secondary structure prediction method based on base fragment coding and ant colony optimization, belonging to the field of bioinformatics research. The RNA sequence secondary structure prediction method comprises the steps of: recoding an RNA sequence, and obtaining a corresponding coding sequence according to corresponding values in a coding table and a coding association list; eliminating a redundancy stem region by a strategy of extending rightwards according to an exact matching table and a partial matching table to obtain all possible useable stem region sets; giving two-dimensional heuristic information in the ant colony optimization, selection rule of an initial stem region and the next stem region and updating strategy of pheromones to construct compatible subsets of all possible stem region sets; and finally obtaining a secondary structure with minimum free energy. The invention can rapidly, accurately and effectively predict the secondary structure without including a pseudojnot RNA sequence, can output the obtained result in a bracket mode way, and is superior to the main prediction technology at present in the aspects of sensitivity and specificity of the prediction of the RNA sequence second structure.

Description

RNA sequence secondary structure prediction method based on base fragment coding and ant group algorithm
Technical field
The invention belongs to the information biology research field.
Background technology
Studies show that RNA has played important effect in gene regulating, and the function of RNA and structure are closely-related, therefore the functional performance of wanting to understand the RNA sequence should be started with from its structure earlier.Obtained at present a large amount of RNA sequence primary structure information, increasing researchist begins to pay close attention to the secondary structure and the tertiary structure of RNA sequence, but determine that with the method for Bioexperiment its tertiary structure cost is high, difficulty is big, and is not all effective to all molecules.The tertiary structure of RNA sequence is difficult to directly obtain by primary structure simultaneously, and directly at the theoretical prediction of its tertiary structure, progress is not very smooth, and the prediction secondary structure is to obtain the only way which must be passed of tertiary structure.Therefore by the secondary structure of software simulation with prediction RNA sequence, and combine with embedded system based on the ARM system, form embedded RNA sequence secondary structure prediction method, with lower cost and faster the time obtain to have certain confidence level result's mode, become the important means of information biology.
Summary of the invention
The object of the present invention is to provide a kind of can be fast, prediction does not comprise false knot accurately and efficiently RNA sequence secondary structure, and with the method for gained result with the output of bracket method pattern.
The present invention includes the following step:
1. carry out the pre-treatment of RNA sequence, specifically comprise the following steps:
1) with RNA sequence input CPLD.
2) by the coding contingency table with the RNA sequence with the coding form be stored among the SRAM, the coding contingency table be present in the system, but and real-time calling, it is converted to the figure pattern that is easy to systems analysis with the RNA sequence of being obtained.
2. carry out RNA sequence secondary structure prediction, specifically comprise the following steps:
1) obtain the stem district set that length is n according to matching list, matching list is present in the system, but and real-time calling, it is with switched RNA sequence information, being combined as length is the stem district set of n.
2) be that the stem district of n adopts the strategy that extends to the right to all length, obtain the stem district set of all length greater than n; The strategy of Yan Shening can calculate the stem district set of all length greater than n to the right.
3) will draw corresponding to the set of all possible stem district of RNA sequence, be stored in and wait among the SDRAM and calling;
4) utilize a certain stem of ARM control chip picked at random district, as the initial node of ant group algorithm;
5) utilize next stem district of policy selection of roulette, up to the set of selectable stem district for empty;
6) minimum free energy of every corresponding secondary structure of ant of calculating, the secondary structure of record and renewal energy minimum;
7), and choose initial node once more and carry out loop computation according to the pheromone value between the given Policy Updates stem district;
8) reach given iteration step number or the satisfied condition that circulates and withdraw from, the secondary structure of exporting the RNA sequence with the bracket method pattern is to LCD, and the iteration step number can be provided with in system in advance.
The present invention can effectively simulate and predict the RNA sequence secondary structure that does not comprise false knot.The present invention carries out recompile by the base sequence with typing, and then obtain corresponding coding sequence by the respective value of coding in the contingency table, and according to complete matching list and incomplete matching list, reject redundant stem district by extension function to the right, obtain the set of all possible useful stem district, according to the selection rule and the pheromone update strategy in the heuristic information of two dimension in the ant group algorithm, initial stem district and next stem district, construct the compatible subclass of all possible stem district set then.
The present invention uses concurrent technique that the secondary structure of RNA sequence is predicted more fast and accurately, can access the compatible stem district set of free energy minimum, the present invention can also test and analyze the sequence of picked at random in the international public database, and the gained result can be exported with the bracket method pattern, make structure representation more directly perceived accurately, experimental result shows that the present invention all is being better than present main flow forecasting techniques aspect susceptibility and the specificity.
Description of drawings
Fig. 1 is the RNA sequence secondary structure prediction method flow diagram based on base fragment coding and ant group algorithm
Fig. 2 is the RNA sequence secondary structure prediction system architecture synoptic diagram based on base fragment coding and ant group algorithm
Embodiment
The present invention is a kind of RNA sequence secondary structure prediction method based on base fragment coding and ant group algorithm, as shown in Figure 1, with gained RNA sequence input CPLD, by the coding contingency table RNA sequence is encoded, the RNA sequence is stored among the SRAM with the form of encoding sequence, and obtain the stem district set that length is n according to matching list, to all length is that the stem district of n adopts the strategy that extends to obtain the stem district set of all length greater than n to the right, being stored in corresponding to the set of all possible stem district of RNA sequence of drawing waited among the SDRAM and being called, then by the initial node of a certain stem of ARM control chip picked at random district as ant group algorithm, and utilize next stem district of policy selection of roulette, be sky until the set of selectable stem district, calculate the minimum free energy of every corresponding secondary structure of ant at last, the secondary structure of record and renewal energy minimum, according to the pheromone value between the given Policy Updates stem district, and choose initial node once more and carry out loop computation, until reaching given iteration step number or satisfying the condition that circulation is withdrawed from, export RNA sequence secondary structure in LCD with the pattern of bracket method.

Claims (7)

1. the RNA sequence secondary structure prediction method based on base fragment coding and ant group algorithm is characterized in that comprising the following steps:
1) carries out the pre-treatment of RNA sequence;
2) carry out RNA sequence secondary structure prediction.
2. by the described RNA sequence secondary structure prediction method of claim 1, it is characterized in that the pre-treatment of the described RNA sequence of step 1) comprises the following steps: based on base fragment coding and ant group algorithm
1) with RNA sequence input CPLD;
2) by the coding contingency table form of RNA sequence with coding is stored among the SRAM.
3. by the described RNA sequence secondary structure prediction method of claim 1, it is characterized in that step 2 based on base fragment coding and ant group algorithm) described RNA sequence secondary structure prediction comprises the following steps:
1) obtains the stem district set that length is n according to matching list;
2) be that the stem district of n adopts the strategy that extends to the right to all length, obtain the stem district set of all length greater than n;
3) will draw corresponding to the set of all possible stem district of RNA sequence, be stored in and wait among the SDRAM and calling;
4) utilize a certain stem of ARM control chip picked at random district, as the initial node of ant group algorithm;
5) utilize next stem district of policy selection of roulette, up to the set of selectable stem district for empty;
6) minimum free energy of every corresponding secondary structure of ant of calculating, the secondary structure of record and renewal energy minimum;
7), and choose initial node once more and carry out loop computation according to the pheromone value between the given Policy Updates stem district;
8) reach given iteration step number or the satisfied condition that circulates and withdraw from, the secondary structure of exporting the RNA sequence with the bracket method pattern is to LCD.
4. by the described RNA sequence secondary structure prediction method of claim 2 based on base fragment coding and ant group algorithm, it is characterized in that step 2) described coding contingency table is present in the system, but and real-time calling, it is converted to the figure pattern that is easy to systems analysis with the RNA sequence of being obtained.
5. by the described RNA sequence secondary structure prediction method of claim 3 based on base fragment coding and ant group algorithm, it is characterized in that the described matching list of step 1) is present in the system, but and real-time calling, it is switched RNA sequence information, being combined as length is the stem district set of n.
6. by the described RNA sequence secondary structure prediction method of claim 3, it is characterized in that step 2 based on base fragment coding and ant group algorithm) the described strategy that extends to the right, can calculate the stem district set of all length greater than n.
7. by the described RNA sequence secondary structure prediction method of claim 3, it is characterized in that the described iteration step number of step 8) can be provided with in advance in system based on base fragment coding and ant group algorithm.
CN200910218023A 2009-12-16 2009-12-16 RNA sequence secondary structure prediction method based on base fragment coding and ant colony optimization Pending CN101717822A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200910218023A CN101717822A (en) 2009-12-16 2009-12-16 RNA sequence secondary structure prediction method based on base fragment coding and ant colony optimization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200910218023A CN101717822A (en) 2009-12-16 2009-12-16 RNA sequence secondary structure prediction method based on base fragment coding and ant colony optimization

Publications (1)

Publication Number Publication Date
CN101717822A true CN101717822A (en) 2010-06-02

Family

ID=42432461

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200910218023A Pending CN101717822A (en) 2009-12-16 2009-12-16 RNA sequence secondary structure prediction method based on base fragment coding and ant colony optimization

Country Status (1)

Country Link
CN (1) CN101717822A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102880811A (en) * 2012-10-24 2013-01-16 吉林大学 Method for predicting secondary structure of ribonucleic acid (RNA) sequence based on complex programmable logic device (CPLD) base fragment encoding and ant colony algorithm
CN103235902A (en) * 2013-04-18 2013-08-07 山东建筑大学 Prediction method for ribose nucleic acid (RNA) structure comprising false knots
CN103593587A (en) * 2013-11-20 2014-02-19 吉林大学 Component-based identification method and device of long-chain RNA secondary structures with pseudo knots
CN109273047A (en) * 2017-12-15 2019-01-25 武汉科技大学 A kind of nucleic acid structure prediction technique based on simulated annealing
CN109599146A (en) * 2018-11-08 2019-04-09 武汉科技大学 A kind of band false knot nucleic acid Structure Prediction Methods based on multi-objective genetic algorithm
CN117116361A (en) * 2023-10-25 2023-11-24 江西师范大学 12sRNA secondary structure visualization method based on fixed frame
CN117497092A (en) * 2024-01-02 2024-02-02 合肥微观纪元数字科技有限公司 RNA structure prediction method and system based on dynamic programming and quantum annealing

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102880811A (en) * 2012-10-24 2013-01-16 吉林大学 Method for predicting secondary structure of ribonucleic acid (RNA) sequence based on complex programmable logic device (CPLD) base fragment encoding and ant colony algorithm
CN103235902A (en) * 2013-04-18 2013-08-07 山东建筑大学 Prediction method for ribose nucleic acid (RNA) structure comprising false knots
CN103235902B (en) * 2013-04-18 2016-03-09 山东建筑大学 Comprise the RNA Structure Prediction Methods of false knot
CN103593587A (en) * 2013-11-20 2014-02-19 吉林大学 Component-based identification method and device of long-chain RNA secondary structures with pseudo knots
CN109273047A (en) * 2017-12-15 2019-01-25 武汉科技大学 A kind of nucleic acid structure prediction technique based on simulated annealing
CN109273047B (en) * 2017-12-15 2022-09-16 武汉科技大学 Nucleic acid structure prediction method based on simulated annealing
CN109599146A (en) * 2018-11-08 2019-04-09 武汉科技大学 A kind of band false knot nucleic acid Structure Prediction Methods based on multi-objective genetic algorithm
CN109599146B (en) * 2018-11-08 2022-04-15 武汉科技大学 Multi-target genetic algorithm-based nucleic acid structure prediction method with false knots
CN117116361A (en) * 2023-10-25 2023-11-24 江西师范大学 12sRNA secondary structure visualization method based on fixed frame
CN117116361B (en) * 2023-10-25 2024-01-26 江西师范大学 12sRNA secondary structure visualization method based on fixed frame
CN117497092A (en) * 2024-01-02 2024-02-02 合肥微观纪元数字科技有限公司 RNA structure prediction method and system based on dynamic programming and quantum annealing
CN117497092B (en) * 2024-01-02 2024-05-14 微观纪元(合肥)量子科技有限公司 RNA structure prediction method and system based on dynamic programming and quantum annealing

Similar Documents

Publication Publication Date Title
CN101717822A (en) RNA sequence secondary structure prediction method based on base fragment coding and ant colony optimization
CN111061569B (en) Heterogeneous multi-core processor task allocation and scheduling strategy based on genetic algorithm
CN102135937B (en) Pairwise overlay integrated software test suite generating method
Chen et al. Improved particle swarm optimization-based form-finding method for suspension bridge installation analysis
Jiang et al. BP Neural Network Could Help Improve Pre‐miRNA Identification in Various Species
Liu et al. Prediction of protein–protein interactions based on PseAA composition and hybrid feature selection
CN106650305B (en) A kind of more tactful group Advances in protein structure prediction based on local abstract convex supporting surface
CN102880811A (en) Method for predicting secondary structure of ribonucleic acid (RNA) sequence based on complex programmable logic device (CPLD) base fragment encoding and ant colony algorithm
CN102193830A (en) Many-core environment-oriented division mapping/reduction parallel programming model
Yones et al. High precision in microRNA prediction: A novel genome-wide approach with convolutional deep residual networks
Chang et al. Quantitative inference of dynamic regulatory pathways via microarray data
Zhao et al. An interpretable ensemble-learning-based open source model for evaluating the fire resistance of concrete-filled steel tubular columns
Kamenetzky et al. MicroRNA discovery in the human parasite Echinococcus multilocularis from genome-wide data
CN102722570A (en) Artificial immunity intelligent optimization system facing geographical space optimization
CN110889250A (en) Steel truss structure damage identification method based on mixed element heuristic algorithm
CN102254225B (en) Evolvable hardware implementation method based on trend-type compact genetic algorithm
Takasaki Methods for selecting effective siRNA target sequences using a variety of statistical and analytical techniques
JP2012502340A (en) Simulating processor execution with branch override
CN109599146A (en) A kind of band false knot nucleic acid Structure Prediction Methods based on multi-objective genetic algorithm
CN102222274A (en) Immune clone selection job shop scheduling method based on scheduling coding
KR20220111215A (en) Apparatus and method for predicting drug-target interaction using deep neural network model based on self-attention
Zhang et al. Multiple Sequence Alignment based on deep Q network with negative feedback policy
Chen et al. PmliHFM: Predicting Plant miRNA-lncRNA Interactions with Hybrid Feature Mining Network
CN114582420A (en) Transcription factor binding site prediction method and system based on fault-tolerant coding and multi-scale dense connection network
Behrmann et al. UPPAAL-Tiga: Timed games for everyone

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20100602