RNA secondary structure prediction based on free energy and phylogenetic analysis
- PMID: 10369773
- DOI: 10.1006/jmbi.1999.2801
RNA secondary structure prediction based on free energy and phylogenetic analysis
Abstract
We describe a computational method for the prediction of RNA secondary structure that uses a combination of free energy and comparative sequence analysis strategies. Using a homology-based sequence alignment as a starting point, all favorable pairings with respect to the Turner energy function are identified. Each potentially paired region within a multiple sequence alignment is scored using a function that combines both predicted free energy and sequence covariation with optimized weightings. High scoring regions are ranked and sequentially incorporated to define a growing secondary structure. Using a single set of optimized parameters, it is possible to accurately predict the foldings of several test RNAs defined previously by extensive phylogenetic and experimental data (including tRNA, 5 S rRNA, SRP RNA, tmRNA, and 16 S rRNA). The algorithm correctly predicts approximately 80% of the secondary structure. A range of parameters have been tested to define the minimal sequence information content required to accurately predict secondary structure and to assess the importance of individual terms in the prediction scheme. This analysis indicates that prediction accuracy most strongly depends upon covariational information and only weakly on the energetic terms. However, relatively few sequences prove sufficient to provide the covariational information required for an accurate prediction. Secondary structures can be accurately defined by alignments with as few as five sequences and predictions improve only moderately with the inclusion of additional sequences.
Copyright 1999 Academic Press.
Similar articles
-
Cofolga: a genetic algorithm for finding the common folding of two RNAs.Comput Biol Chem. 2005 Apr;29(2):111-9. doi: 10.1016/j.compbiolchem.2005.02.004. Comput Biol Chem. 2005. PMID: 15833439
-
Dynalign: an algorithm for finding the secondary structure common to two RNA sequences.J Mol Biol. 2002 Mar 22;317(2):191-203. doi: 10.1006/jmbi.2001.5351. J Mol Biol. 2002. PMID: 11902836
-
Predicting a set of minimal free energy RNA secondary structures common to two sequences.Bioinformatics. 2005 May 15;21(10):2246-53. doi: 10.1093/bioinformatics/bti349. Epub 2005 Feb 24. Bioinformatics. 2005. PMID: 15731207
-
From consensus structure prediction to RNA gene finding.Brief Funct Genomic Proteomic. 2009 Nov;8(6):461-71. doi: 10.1093/bfgp/elp043. Brief Funct Genomic Proteomic. 2009. PMID: 19833701 Review.
-
Revolutions in RNA secondary structure prediction.J Mol Biol. 2006 Jun 9;359(3):526-32. doi: 10.1016/j.jmb.2006.01.067. Epub 2006 Feb 6. J Mol Biol. 2006. PMID: 16500677 Review.
Cited by
-
Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction.BMC Bioinformatics. 2004 Jun 4;5:71. doi: 10.1186/1471-2105-5-71. BMC Bioinformatics. 2004. PMID: 15180907 Free PMC article.
-
The RNA encoding the microtubule-associated protein tau has extensive structure that affects its biology.PLoS One. 2019 Jul 10;14(7):e0219210. doi: 10.1371/journal.pone.0219210. eCollection 2019. PLoS One. 2019. PMID: 31291322 Free PMC article.
-
Thermodynamic and kinetic characterization of antisense oligodeoxynucleotide binding to a structured mRNA.Biophys J. 2002 Jan;82(1 Pt 1):366-77. doi: 10.1016/S0006-3495(02)75401-5. Biophys J. 2002. PMID: 11751323 Free PMC article.
-
Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.BMC Bioinformatics. 2006 Sep 4;7:400. doi: 10.1186/1471-2105-7-400. BMC Bioinformatics. 2006. PMID: 16952317 Free PMC article.
-
Sequence comparison and secondary structure analysis of the 3' noncoding region of flavivirus genomes reveals multiple pseudoknots.RNA. 2001 Oct;7(10):1370-7. RNA. 2001. PMID: 11680841 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources