CN101281561A - Method for quantitative analyzing evolution of RNA structure steadiness - Google Patents

Method for quantitative analyzing evolution of RNA structure steadiness Download PDF

Info

Publication number
CN101281561A
CN101281561A CNA2008101115100A CN200810111510A CN101281561A CN 101281561 A CN101281561 A CN 101281561A CN A2008101115100 A CNA2008101115100 A CN A2008101115100A CN 200810111510 A CN200810111510 A CN 200810111510A CN 101281561 A CN101281561 A CN 101281561A
Authority
CN
China
Prior art keywords
rna
sequence
evolution
steadiness
rna sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2008101115100A
Other languages
Chinese (zh)
Other versions
CN100559381C (en
Inventor
王升启
舒文杰
伯晓晨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Radiation Medicine of CAMMS
Original Assignee
Institute of Radiation Medicine of CAMMS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Radiation Medicine of CAMMS filed Critical Institute of Radiation Medicine of CAMMS
Priority to CNB2008101115100A priority Critical patent/CN100559381C/en
Publication of CN101281561A publication Critical patent/CN101281561A/en
Application granted granted Critical
Publication of CN100559381C publication Critical patent/CN100559381C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention relates to a computer program, more specifically to a method for quantitatively analyzing the evolution of RNA structural robustness. The invention aims to provide a quantitative analytical method capable of analyzing RNA structural robustness simply, conveniently and quickly, solves the problem that RNA structural robustness is difficult to evaluate and quantify, reaches the objective of analysis of biological robustness origin and evolution thereof, thus improving the understanding of biological evolution. The invention is based on a RNA secondary structure as the research platform, provides a method for quantitatively analyzing the evolution of RNA structural robustness. The method comprises the steps of checking the validity of a RNA sequence inputted from a computer terminal, generating a control sequence, calculating RNA structural robustness, and quantitatively analyzing the evolution of the RNA structural robustness.

Description

A kind of method of evolution of quantitative analyzing RNA structure steadiness
Technical field
The present invention relates to a kind of computer program, more specifically, is a kind of method of evolution of quantitative analyzing RNA structure steadiness.
Background technology
Biological robustness is the most basic and ubiquitous a kind of phenomenon in the biosystem.Before it is understood to be in various interference surface, still can keep a kind of ability of stabilization function.According to the difference of jamming pattern whether (can heredity), robustness be divided into hereditary robustness and environment robustness.The heredity robustness is meant before the genetic mutation interference surface insensitivity of phenotype; And before the environment robustness is meant the interference surface of environmental factor externally, the insensitivity of phenotype.All the time, the biologist pays special attention to the research of biological robustness, studies the canalization research of Waddington from the dominance of Fisher.Studies show that, on each level of biosystem, all have robustness, comprise that gene expression, protein folding, metabolic flux, health self-care are regulated, growth, even cmot.The origin of robustness and evolution will help our understanding to biological evolution in the correct understanding biosystem.
The RNA secondary structure is a good platform of the biological robustness of research.In fact, there has been Many researchers to study robustness among RNA viruses, viroid and the microRNA.Although there is a lot of research to pay close attention to the evolutionary mechanism of robustness, up to now, the origin of robustness and evolution thereof still are not very clear.Cause the reason of this present situation, mainly owing in biosystem, being difficult to provide the quantitative analysis method that robustness is evolved.
Summary of the invention
The present invention aims to provide a kind of quantitative analysis method that can measure the evolution of RNA structure steadiness simply, quickly and easily, solve difficult, the quantitatively difficult problem of robustness evolution assessment, reach the origin of the biological robustness of analysis and the purpose of evolution thereof, improve understanding therefrom biological evolution.
In order to achieve the above object, the present invention is a research platform with the RNA secondary structure, a kind of method of evolution of quantitative analyzing RNA structure steadiness is provided in computer system, this method comprises legitimacy, generation control sequence, the calculating RNA structure steadiness of inspection from the RNA sequence of terminal input, the step of the evolution of quantitative analyzing RNA structure steadiness.
In a kind of method of evolution of quantitative analyzing RNA structure steadiness, the generation of control sequence is on the basis of selected disorder method, along the hamming distance of the RNA sequence of importing (length is l), adopt monte carlo method stochastic sampling N bar sequence, common property is given birth to l * N bar random series.The present invention has realized five kinds of disorder methods that produce control sequence altogether, specifically describes as follows:
● completely random: produce the random series that has equal length with list entries;
● single base scramble: the position of base in the random permutation sequence;
● double alkali yl scramble:, obtain double alkali yl scramble sequence according to the Erikson-Altschul algorithm.
● based on the scramble of zeroth order Markov model: single base frequency P (b) in the sequence of calculation.According to this frequency in the different base of each site stochastic sampling till the length that reaches list entries;
● based on the scramble of single order Markov model: the conditional probability P (a|b) that given base b base a occurs in the sequence of calculation.Select the base x in first site at random 1, according to conditional probability P (x I+1| x i) select the base x in next site I+1, till the length that reaches list entries;
In a kind of method of evolution of quantitative analyzing RNA structure steadiness, adopt the quantitative test index of neutral value as the RNA structure steadiness, neutral value is defined as
γ = 1 3 × l Σ i = 1 3 × l l - d i l - - - ( 1 )
Wherein, d i, i=1,2 ..., 3 * l is the structure distance between RNA sequence and its i mutant sequence, l is the length of RNA sequence.Neutral value γ is big more, shows that this RNA sequence has the robustness of higher level.Structure between RNA sequence and its mutant sequence is divided into two kinds of situations apart from the calculating of d: under the situation of only considering the minimum free energy structure, d is the RNA sequence of employing different structure metric calculation and the distance of the minimum free energy structure between its mutant sequence, these degree of structuration measurings comprise the string editing distance, tree editing distance and base-pair distance; Under the situation of considering the suboptimum structure, d by the structural entity between RNA sequence and its mutant sequence apart from δ mProvide.Structural entity is apart from δ mBe defined as follows:
δ m ( x , y ) = Σ S , S ′ p x ( S ) p y ( S ′ ) δ ( S , S ′ ) - - - ( 2 )
Wherein, p x(S) be the equilibrium probability of structure S in the structural entity of sequence x, p y(S ') is the equilibrium probability of structure S ' in the structural entity of sequences y, and δ (S, S ') is the distance of structure S and S '.
In a kind of method of evolution of quantitative analyzing RNA structure steadiness, the quantitative test of the evolution of RNA structure steadiness is to carry out along the hamming distance of the RNA sequence of input.Concrete operations are as follows: calculate the RNA sequence of input respectively and the robustness γ of the control sequence that produces along the hamming distance samples and
Figure A20081011151000043
Wherein N is the number of the control sequence of generation on each hamming distance, and l is the length of the RNA sequence of input.Each hamming apart from j on, relatively γ and Υ j, the conspicuousness of the RNA sequence of analysis input robustness on each hamming distance is calculated the last corresponding p-value value of each hamming distance.And provide the curve of RNA structure steadiness with the hamming variable in distance, the i.e. quantitative analysis results of the evolution of RNA structure steadiness thus.Hamming apart from j on, the p-value value defined is
p j = M N + 1 - - - ( 3 )
Wherein, M represents set
Figure A20081011151000052
In, the number of the sequence more sane, i.e. set than the RNA sequence of input
Figure A20081011151000053
In the number of the neutral value bigger than the value of γ.
Description of drawings
Fig. 1 is the The general frame of method of the evolution of a kind of quantitative analyzing RNA structure steadiness of the present invention;
Fig. 2 is the process flow diagram that calculates the evolution of RNA structure steadiness among Fig. 1;
Fig. 3 is the analysis result of the evolution of the structure steadiness of microRNA let-7 in the nematode.
Embodiment
Fig. 1 is the The general frame of the method for the evolution of a kind of quantitative analyzing RNA structure steadiness of the present invention.
To RNA sequence,, do validity checking according to the definition of RNA sequence from the terminal input.The RNA sequence is to take from alphabet A character string R=r 1, r 2..., r n, wherein I=1,2 ..., n.To not meeting the list entries of this definition, then return and re-enter.Adopt the present invention, the example of analysis is that length is the sequence of the microRNA let-7 precursor of l=99 in the nematode:
UACACUGUGGAUCCGGUGAGGUAGUAGGUUGUAUAGUUUGGAAUAUUACCACCGGU
GAACUAUGCAAUUUUCUACCUUACCGGAGACAGAACUCUUCGA
After to the RNA sequence checking legitimacy of importing from terminal, hamming distance along the RNA sequence of importing, the disorder method of the completely random in selected five kinds of disorder methods, adopt monte carlo method stochastic sampling N=1, article 000, random rna sequence, common property is given birth to l * N=99,000 random rna sequence.
To the RNA sequence microRNA let-7 and the last contrast RNA sequence of each hamming distance thereof of input, calculate their structure steadiness, Fig. 2 has provided the process flow diagram that calculates the evolution of RNA structure steadiness.To every RNA sequence, because each site has four bases
Figure A20081011151000056
Available, remove itself, can produce three mutant in each site.For example, to the microRNA let-7 precursor sequence of input, the base U in its first site can sport other three base A, C and U, and such three mutant sequences are:
Site mutation body sequence
AACACUGUGGAUCCGGUGAGGUAGUAGGUUGUAUAGUUUGGAAUAUUACCA
1
CCGGUGAACUAUGCAAUUUUCUACCUUACCGGAGACAGAACUCUUCGA
CACACUGUGGAUCCGGUGAGGUAGUAGGUUGUAUAGUUUGGAAUAUUACCA
1
CCGGUGAACUAUGCAAUUUUCUACCUUACCGGAGACAGAACUCUUCGA
GACACUGUGGAUCCGGUGAGGUAGUAGGUUGUAUAGUUUGGAAUAUUACCA
1
CCGGUGAACUAUGCAAUUUUCUACCUUACCGGAGACAGAACUCUUCGA
Utilize the folding program RNAfold of RNA secondary structure of standard, the RNA sequence of input and three mutant sequences (having 3 * l mutant sequence) in each site thereof are folded into corresponding secondary structure.If only consider the minimum free energy structure, utilize the RNA secondary structure distance metric program RNAdistance of standard, selected distance tolerance (string editing distance, tree editing distance or three kinds of distance metrics of base-pair distance), the RNA sequence of calculating input and the structure between its each mutant sequence are apart from d.If consider the suboptimum structure, then utilize the RNA secondary structure overall distance tolerance program RNApdist of standard, the RNA sequence of calculating input and the structural entity between its each mutant sequence are apart from δ mAfter the structure distance that obtains between RNA sequence and its mutant sequence,, calculate 3 * l mutant sequence
Figure A20081011151000061
I=1,2 ..., the value of 3 * l is added up their mean value, promptly obtains the neutral value γ of defined RNA sequence in (1) formula.
In a kind of method of evolution of quantitative analyzing RNA structure steadiness, according to top flow process, calculate the RNA sequence of input and the robustness γ of the control sequence that produces along the hamming distance samples and
Figure A20081011151000062
Wherein N is the number of the control sequence of the last generation of each hamming distance, and l is the length of the RNA sequence of input.Subsequently, analyze the evolution of RNA structure steadiness.On each hamming distance, according to formula (3), calculate the last corresponding p-value value of each hamming distance, and provide the analysis result of the evolution of RNA structure steadiness thus.What Fig. 3 showed is the analysis result of the evolution of the structure steadiness of microRNA let-7 in the nematode.
The present invention adopts the structure steadiness of neutral value qualitative assessment RNA molecule, can be simply, the quantitative analyzing RNA structure steadiness is along with the evolution of hamming distance quickly and easily, and RNA is evolved has important significance for theories and practical value.

Claims (5)

1. the method for the evolution of a quantitative analyzing RNA structure steadiness is characterized in that described method comprises the following steps:
1) receives the RNA sequence of importing from terminal (length is l), differentiate the legitimacy of this sequence;
2) select disorder method, on each hamming distance, generate corresponding control sequence;
3) according to the definition of neutral value, the RNA sequence of calculating input and each hamming thereof are apart from the neutral value of last control sequence;
4) evolution of quantitative analyzing RNA structure steadiness.
2. the method for the evolution of a kind of quantitative analyzing RNA structure steadiness according to claim 1, wherein said disorder method, it is characterized in that it comprises that completely random, single base scramble, double alkali yl put, amount to five kinds of method of randomization that produce control sequence based on the scramble of zeroth order Markov model with based on the scramble of single order Markov model.
3. the method for the evolution of a kind of quantitative analyzing RNA structure steadiness according to claim 1, wherein said neutral value is characterized in that, the structure in its definition between RNA sequence and its mutant sequence is divided into two kinds of situations apart from the calculating of d:
1) under the situation of only considering the minimum free energy structure, the structure between RNA sequence and its mutant sequence is apart from d, and by the string editing distance of the minimum free energy structure between RNA sequence and the mutant sequence, tree editing distance or base-pair distance provide;
2) under the situation of considering the suboptimum structure, the structure between RNA sequence and its mutant sequence apart from d by the structural entity between RNA sequence and its mutant sequence apart from δ mProvide.
4. the situation of consideration suboptimum structure according to claim 3 is characterized in that, it is meant consideration all suboptimum structures in the minimum free energy 1kcal/mol of RNA sequence of importing and mutant sequence.
5. the method for the evolution of a kind of quantitative analyzing RNA structure steadiness according to claim 1, the evolution of wherein said quantitative analyzing RNA structure steadiness, it is characterized in that its result is provided with the curve of hamming variable in distance by the p-value value of the conspicuousness of the structure steadiness of the RNA sequence of input.
CNB2008101115100A 2008-06-05 2008-06-05 A kind of method of evolution of quantitative analyzing RNA structure steadiness Expired - Fee Related CN100559381C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2008101115100A CN100559381C (en) 2008-06-05 2008-06-05 A kind of method of evolution of quantitative analyzing RNA structure steadiness

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2008101115100A CN100559381C (en) 2008-06-05 2008-06-05 A kind of method of evolution of quantitative analyzing RNA structure steadiness

Publications (2)

Publication Number Publication Date
CN101281561A true CN101281561A (en) 2008-10-08
CN100559381C CN100559381C (en) 2009-11-11

Family

ID=40014028

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2008101115100A Expired - Fee Related CN100559381C (en) 2008-06-05 2008-06-05 A kind of method of evolution of quantitative analyzing RNA structure steadiness

Country Status (1)

Country Link
CN (1) CN100559381C (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103106351A (en) * 2013-02-28 2013-05-15 重庆科技学院 Question mark dependent type leptospira replicator loss ratio analysis method
CN105528532A (en) * 2014-09-30 2016-04-27 深圳华大基因科技有限公司 A feature analysis method for RNA editing sites
CN116864001A (en) * 2023-09-04 2023-10-10 深圳市前海高新国际医疗管理有限公司 Animal model RNA expression quantitative analysis system and method based on AI

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006170670A (en) * 2004-12-13 2006-06-29 Sony Corp Standardization method for gene expression level, program and system

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103106351A (en) * 2013-02-28 2013-05-15 重庆科技学院 Question mark dependent type leptospira replicator loss ratio analysis method
CN103106351B (en) * 2013-02-28 2017-08-04 重庆科技学院 A kind of Leptospira interrogans serovar Lai replicator loss late analysis method
CN105528532A (en) * 2014-09-30 2016-04-27 深圳华大基因科技有限公司 A feature analysis method for RNA editing sites
CN105528532B (en) * 2014-09-30 2019-08-16 深圳华大基因科技有限公司 A kind of characteristic analysis method in rna editing site
CN116864001A (en) * 2023-09-04 2023-10-10 深圳市前海高新国际医疗管理有限公司 Animal model RNA expression quantitative analysis system and method based on AI
CN116864001B (en) * 2023-09-04 2023-12-26 深圳市前海高新国际医疗管理有限公司 Animal model RNA expression quantitative analysis system and method based on AI

Also Published As

Publication number Publication date
CN100559381C (en) 2009-11-11

Similar Documents

Publication Publication Date Title
Chou et al. A comparative study of SVDquartets and other coalescent-based species tree estimation methods
Lorenz et al. Predicting RNA secondary structures from sequence and probing data
Aviran et al. Modeling and automation of sequencing-based characterization of RNA structure
Cui et al. Guitar: an R/bioconductor package for gene annotation guided transcriptomic analysis of RNA‐related genomic features
Browning et al. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals
Gronau et al. Inference of natural selection from interspersed genomic elements based on polymorphism and divergence
Chan et al. An excess of risk-increasing low-frequency variants can be a signal of polygenic inheritance in complex diseases
Zhang et al. A review on recent computational methods for predicting noncoding RNAs
Weinberg et al. Faster genome annotation of non-coding RNA families without loss of accuracy
Belle et al. An investigation of the variation in the transition bias among various animal mitochondrial DNA
Matsumoto et al. Evaluation of ancestral sequence reconstruction methods to infer nonstationary patterns of nucleotide substitution
Sánchez-Gracia et al. Impact of deep coalescence on the reliability of species tree inference from different types of DNA markers in mammals
Song et al. Comparative chloroplast genomes of Sorghum species: sequence divergence and phylogenetic relationships
Peng et al. Forward-time simulation of realistic samples for genome-wide association studies
Li et al. A gene-based information gain method for detecting gene–gene interactions in case–control studies
Wang et al. A performance study of the impact of recombination on species tree analysis
Song et al. De novo post-SELEX optimization of a G-quadruplex DNA aptamer binding to marine toxin gonyautoxin 1/4
CN100559381C (en) A kind of method of evolution of quantitative analyzing RNA structure steadiness
Zhen et al. Greater strength of selection and higher proportion of beneficial amino acid changing mutations in humans compared with mice and Drosophila melanogaster
Chung et al. graph-GPA: a graphical model for prioritizing GWAS results and investigating pleiotropic architecture
Taub et al. Incorporating genotype uncertainties into the genotypic TDT for main effects and gene‐environment interactions
Audenaert et al. Impact of dissolved organic matter (DOM) on parameter sensitivity of a kinetic ozone decomposition model
Zhang et al. A multilocus linkage disequilibrium measure based on mutual information theory and its applications
Belov et al. Modeling of the DNA melting point dependence on various analysis factors
Ding et al. Insights into the coupling of duplication events and macroevolution from an age profile of animal transmembrane gene families

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20091111

Termination date: 20110605