Automatic genome-wide reconstruction of phylogenetic gene trees
- PMID: 17646342
- DOI: 10.1093/bioinformatics/btm193
Automatic genome-wide reconstruction of phylogenetic gene trees
Abstract
Gene duplication and divergence is a major evolutionary force. Despite the growing number of fully sequenced genomes, methods for investigating these events on a genome-wide scale are still in their infancy. Here, we present SYNERGY, a novel and scalable algorithm that uses sequence similarity and a given species phylogeny to reconstruct the underlying evolutionary history of all genes in a large group of species. In doing so, SYNERGY resolves homology relations and accurately distinguishes orthologs from paralogs. We applied our approach to a set of nine fully sequenced fungal genomes spanning 150 million years, generating a genome-wide catalog of orthologous groups and corresponding gene trees. Our results are highly accurate when compared to a manually curated gold standard, and are robust to the quality of input according to a novel jackknife confidence scoring. The reconstructed gene trees provide a comprehensive view of gene evolution on a genomic scale. Our approach can be applied to any set of sequenced eukaryotic species with a known phylogeny, and opens the way to systematic studies of the evolution of individual genes, molecular systems and whole genomes.
Supplementary information: Supplementary data are available at Bioinformatics online.
Similar articles
-
Assessment of phylogenomic and orthology approaches for phylogenetic inference.Bioinformatics. 2007 Apr 1;23(7):815-24. doi: 10.1093/bioinformatics/btm015. Epub 2007 Jan 19. Bioinformatics. 2007. PMID: 17237036
-
Improving the specificity of high-throughput ortholog prediction.BMC Bioinformatics. 2006 May 28;7:270. doi: 10.1186/1471-2105-7-270. BMC Bioinformatics. 2006. PMID: 16729895 Free PMC article.
-
DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony.Bioinformatics. 2008 Jul 1;24(13):1540-1. doi: 10.1093/bioinformatics/btn230. Epub 2008 May 12. Bioinformatics. 2008. PMID: 18474508
-
Causes, consequences and solutions of phylogenetic incongruence.Brief Bioinform. 2015 May;16(3):536-48. doi: 10.1093/bib/bbu015. Epub 2014 May 27. Brief Bioinform. 2015. PMID: 24872401 Review.
-
Molecular Phylogenetics: Concepts for a Newcomer.Adv Biochem Eng Biotechnol. 2017;160:185-196. doi: 10.1007/10_2016_49. Adv Biochem Eng Biotechnol. 2017. PMID: 27783136 Review.
Cited by
-
A low-polynomial algorithm for assembling clusters of orthologous groups from intergenomic symmetric best matches.Bioinformatics. 2010 Jun 15;26(12):1481-7. doi: 10.1093/bioinformatics/btq229. Epub 2010 May 2. Bioinformatics. 2010. PMID: 20439257 Free PMC article.
-
Evolution of the Drosophila melanogaster Chromatin Landscape and Its Associated Proteins.Genome Biol Evol. 2019 Mar 1;11(3):660-677. doi: 10.1093/gbe/evz019. Genome Biol Evol. 2019. PMID: 30689829 Free PMC article.
-
Large-scale assignment of orthology: back to phylogenetics?Genome Biol. 2008 Oct 30;9(10):235. doi: 10.1186/gb-2008-9-10-235. Genome Biol. 2008. PMID: 18983710 Free PMC article. Review.
-
Conserved and Diverged Functions of the Calcineurin-Activated Prz1 Transcription Factor in Fission Yeast.Genetics. 2016 Apr;202(4):1365-75. doi: 10.1534/genetics.115.184218. Epub 2016 Feb 19. Genetics. 2016. PMID: 26896331 Free PMC article.
-
Positional orthology: putting genomic evolutionary relationships into context.Brief Bioinform. 2011 Sep;12(5):401-12. doi: 10.1093/bib/bbr040. Epub 2011 Jun 24. Brief Bioinform. 2011. PMID: 21705766 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources