Using supermatrices for phylogenetic inquiry: an example using the sedges
- PMID: 23103590
- DOI: 10.1093/sysbio/sys088
Using supermatrices for phylogenetic inquiry: an example using the sedges
Abstract
In this article, we use supermatrix data-mining methods to reconstruct a large, highly inclusive phylogeny of Cyperaceae from nucleotide data available on GenBank. We explore the properties of these trees and their utility for phylogenetic inference, and show that even the highly incomplete alignments characteristic of supermatrix approaches may yield very good estimates of phylogeny. We present a novel pipeline for filtering sparse alignments to improve their phylogenetic utility by maximizing the partial decisiveness of the matrices themselves through a technique we call "phylogenetic scaffolding," and we present a new method of scoring tip instability (i.e. "rogue taxa") based on the I statistic implemented in the software Mesquite. The modified statistic, which we call I(S), is somewhat more straightforward to interpret than similar statistics, and our implementation of it may be applied to large sets of large trees. The largest sedge trees presented here contain more than 1500 tips (about one quarter of all sedge species) and are based on multigene alignments with more than 20 000 sites and more than 90% missing data. These trees match well with previously supported phylogenetic hypotheses, but have lower overall support values and less resolution than more heavily filtered trees. Our best-resolved trees are characterized by stronger support values than any previously published sedge phylogenies, and show some relationships that are incongruous with previous studies. Overall, we show that supermatrix methods offer powerful means of pursuing phylogenetic study and these tools have high potential value for many systematic biologists.
Similar articles
-
Sparse supermatrices for phylogenetic inference: taxonomy, alignment, rogue taxa, and the phylogeny of living turtles.Syst Biol. 2010 Jan;59(1):42-58. doi: 10.1093/sysbio/syp075. Epub 2009 Nov 11. Syst Biol. 2010. PMID: 20525619
-
The use and validity of composite taxa in phylogenetic analysis.Syst Biol. 2009 Dec;58(6):560-72. doi: 10.1093/sysbio/syp056. Epub 2009 Sep 21. Syst Biol. 2009. PMID: 20525609
-
Phylogenetic supermatrix analysis of GenBank sequences from 2228 papilionoid legumes.Syst Biol. 2006 Oct;55(5):818-36. doi: 10.1080/10635150600999150. Syst Biol. 2006. PMID: 17060202
-
The supermatrix approach to systematics.Trends Ecol Evol. 2007 Jan;22(1):34-41. doi: 10.1016/j.tree.2006.10.002. Epub 2006 Oct 13. Trends Ecol Evol. 2007. PMID: 17046100 Review.
-
[A bird's eye view of the algorithms and software packages for reconstructing phylogenetic trees].Dongwuxue Yanjiu. 2013 Dec;34(6):640-50. Dongwuxue Yanjiu. 2013. PMID: 24415699 Review. Chinese.
Cited by
-
Some limitations of public sequence data for phylogenetic inference (in plants).PLoS One. 2014 Jul 7;9(7):e98986. doi: 10.1371/journal.pone.0098986. eCollection 2014. PLoS One. 2014. PMID: 24999823 Free PMC article.
-
Diversification of flowering plants in space and time.Nat Commun. 2023 Nov 22;14(1):7609. doi: 10.1038/s41467-023-43396-8. Nat Commun. 2023. PMID: 37993449 Free PMC article.
-
Terrace Aware Data Structure for Phylogenomic Inference from Supermatrices.Syst Biol. 2016 Nov;65(6):997-1008. doi: 10.1093/sysbio/syw037. Epub 2016 Apr 26. Syst Biol. 2016. PMID: 27121966 Free PMC article.
-
Tackling Rapid Radiations With Targeted Sequencing.Front Plant Sci. 2020 Jan 9;10:1655. doi: 10.3389/fpls.2019.01655. eCollection 2019. Front Plant Sci. 2020. PMID: 31998342 Free PMC article.
-
UFBoot2: Improving the Ultrafast Bootstrap Approximation.Mol Biol Evol. 2018 Feb 1;35(2):518-522. doi: 10.1093/molbev/msx281. Mol Biol Evol. 2018. PMID: 29077904 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous