Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Aug;15(8):1153-60.
doi: 10.1101/gr.3567505.

Estimating the tempo and mode of gene family evolution from comparative genomic data

Affiliations

Estimating the tempo and mode of gene family evolution from comparative genomic data

Matthew W Hahn et al. Genome Res. 2005 Aug.

Abstract

Comparison of whole genomes has revealed that changes in the size of gene families among organisms is quite common. However, there are as yet no models of gene family evolution that make it possible to estimate ancestral states or to infer upon which lineages gene families have contracted or expanded. In addition, large differences in family size have generally been attributed to the effects of natural selection, without a strong statistical basis for these conclusions. Here we use a model of stochastic birth and death for gene family evolution and show that it can be efficiently applied to multispecies genome comparisons. This model takes into account the lengths of branches on phylogenetic trees, as well as duplication and deletion rates, and hence provides expectations for divergence in gene family size among lineages. The model offers both the opportunity to identify large-scale patterns in genome evolution and the ability to make stronger inferences regarding the role of natural selection in gene family expansion or contraction. We apply our method to data from the genomes of five yeast species to show its applicability.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Here, we visually explain the “large family bias” problem. The solid line shows the likelihood of gene families with sizes (k(k(k(kk)))) as a function of k, for values of k from 1 to 50. The dashed line shows the average likelihood of gene families evolved from a common ancestor with family size equal to k. The average is computed over 100 random samples for each value of k. Clearly, the likelihoods for large gene families are consistently and significantly smaller.
Figure 2.
Figure 2.
The phylogenetic tree. Branch lengths t are given in millions of years. The branch numbers used in this study are shown in circles.

Similar articles

Cited by

References

    1. Abril, J.F., Agarwal, P., Alexandersson, M., Antonarakis, S.E., Baertsch, R., Berry, E., Birney, E., Bork, P., Bray, N., Brent, M.R., et al. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420: 520–562. - PubMed
    1. Bailey, N. 1964. The elements of stochastic processes. John Wiley & Sons, Inc., New York.
    1. Berger, R.L. and Boos, D.D. 1994. P values maximized over a confidence set for the nuisance parameter. J. Amer. Statist. Assoc. 89: 1012–1016.
    1. Cliften, P., Sudarsanam, P., Desikan, A., Fulton, L., Fulton, B., Majors, J., Waterston, R., Cohen, B.A., and Johnston, M. 2003. Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science 301: 71–76. - PubMed
    1. Copley, R., Goodstadt, L., and Ponting, C. 2003. Eukaryotic domain evolution inferred from genome comparisons. Curr. Opin. Genet. Dev. 13: 623–628. - PubMed

Publication types

LinkOut - more resources