Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting
- PMID: 14694080
- DOI: 10.1093/molbev/msh049
Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting
Abstract
Due to its speed, the distance approach remains the best hope for building phylogenies on very large sets of taxa. Recently (R. Desper and O. Gascuel, J. Comp. Biol. 9:687-705, 2002), we introduced a new "balanced" minimum evolution (BME) principle, based on a branch length estimation scheme of Y. Pauplin (J. Mol. Evol. 51:41-47, 2000). Initial simulations suggested that FASTME, our program implementing the BME principle, was more accurate than or equivalent to all other distance methods we tested, with running time significantly faster than Neighbor-Joining (NJ). This article further explores the properties of the BME principle, and it explains and illustrates its impressive topological accuracy. We prove that the BME principle is a special case of the weighted least-squares approach, with biologically meaningful variances of the distance estimates. We show that the BME principle is statistically consistent. We demonstrate that FASTME only produces trees with positive branch lengths, a feature that separates this approach from NJ (and related methods) that may produce trees with branches with biologically meaningless negative lengths. Finally, we consider a large simulated data set, with 5,000 100-taxon trees generated by the Aldous beta-splitting distribution encompassing a range of distributions from Yule-Harding to uniform, and using a covarion-like model of sequence evolution. FASTME produces trees faster than NJ, and much faster than WEIGHBOR and the weighted least-squares implementation of PAUP*. Moreover, FASTME trees are consistently more accurate at all settings, ranging from Yule-Harding to uniform distributions, and all ranges of maximum pairwise divergence and departure from molecular clock. Interestingly, the covarion parameter has little effect on the tree quality for any of the algorithms. FASTME is freely available on the web.
Similar articles
-
Robustness of phylogenetic inference based on minimum evolution.Bull Math Biol. 2010 Oct;72(7):1820-39. doi: 10.1007/s11538-010-9510-y. Epub 2010 May 7. Bull Math Biol. 2010. PMID: 20449671
-
Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle.J Comput Biol. 2002;9(5):687-705. doi: 10.1089/106652702761034136. J Comput Biol. 2002. PMID: 12487758
-
Evaluating the relationship between evolutionary divergence and phylogenetic accuracy in AFLP data sets.Mol Biol Evol. 2010 May;27(5):988-1000. doi: 10.1093/molbev/msp315. Epub 2009 Dec 21. Mol Biol Evol. 2010. PMID: 20026482
-
Neighbor-joining revealed.Mol Biol Evol. 2006 Nov;23(11):1997-2000. doi: 10.1093/molbev/msl072. Epub 2006 Jul 28. Mol Biol Evol. 2006. PMID: 16877499 Review.
-
Phylogenetic analysis in molecular evolutionary genetics.Annu Rev Genet. 1996;30:371-403. doi: 10.1146/annurev.genet.30.1.371. Annu Rev Genet. 1996. PMID: 8982459 Review.
Cited by
-
The Arabidopsis thaliana ortholog of a purported maize cholinesterase gene encodes a GDSL-lipase.Plant Mol Biol. 2013 Apr;81(6):565-76. doi: 10.1007/s11103-013-0021-8. Epub 2013 Feb 22. Plant Mol Biol. 2013. PMID: 23430565 Free PMC article.
-
Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences.BMC Bioinformatics. 2006 Jul 19;7:350. doi: 10.1186/1471-2105-7-350. BMC Bioinformatics. 2006. PMID: 16854218 Free PMC article.
-
Transcriptome-Based Identification of a Functional Fasciola hepatica Carboxylesterase B.Pathogens. 2021 Nov 10;10(11):1454. doi: 10.3390/pathogens10111454. Pathogens. 2021. PMID: 34832612 Free PMC article.
-
Fast phylogenetic DNA barcoding.Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3997-4002. doi: 10.1098/rstb.2008.0169. Philos Trans R Soc Lond B Biol Sci. 2008. PMID: 18852104 Free PMC article.
-
CRISPR-Cas13 Inhibitors Block RNA Editing in Bacteria and Mammalian Cells.Mol Cell. 2020 Jun 4;78(5):850-861.e5. doi: 10.1016/j.molcel.2020.03.033. Epub 2020 Apr 28. Mol Cell. 2020. PMID: 32348779 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous