Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes
- PMID: 16024819
- PMCID: PMC1182216
- DOI: 10.1101/gr.3715005
Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes
Abstract
We have conducted a comprehensive search for conserved elements in vertebrate genomes, using genome-wide multiple alignments of five vertebrate species (human, mouse, rat, chicken, and Fugu rubripes). Parallel searches have been performed with multiple alignments of four insect species (three species of Drosophila and Anopheles gambiae), two species of Caenorhabditis, and seven species of Saccharomyces. Conserved elements were identified with a computer program called phastCons, which is based on a two-state phylogenetic hidden Markov model (phylo-HMM). PhastCons works by fitting a phylo-HMM to the data by maximum likelihood, subject to constraints designed to calibrate the model across species groups, and then predicting conserved elements based on this model. The predicted elements cover roughly 3%-8% of the human genome (depending on the details of the calibration procedure) and substantially higher fractions of the more compact Drosophila melanogaster (37%-53%), Caenorhabditis elegans (18%-37%), and Saccharaomyces cerevisiae (47%-68%) genomes. From yeasts to vertebrates, in order of increasing genome size and general biological complexity, increasing fractions of conserved bases are found to lie outside of the exons of known protein-coding genes. In all groups, the most highly conserved elements (HCEs), by log-odds score, are hundreds or thousands of bases long. These elements share certain properties with ultraconserved elements, but they tend to be longer and less perfectly conserved, and they overlap genes of somewhat different functional categories. In vertebrates, HCEs are associated with the 3' UTRs of regulatory genes, stable gene deserts, and megabase-sized regions rich in moderately conserved noncoding sequences. Noncoding HCEs also show strong statistical evidence of an enrichment for RNA secondary structure.
Figures
Similar articles
-
Structural relationships between highly conserved elements and genes in vertebrate genomes.PLoS One. 2008;3(11):e3727. doi: 10.1371/journal.pone.0003727. Epub 2008 Nov 14. PLoS One. 2008. PMID: 19008958 Free PMC article.
-
Conserved distances between vertebrate highly conserved elements.Hum Mol Genet. 2006 Oct 1;15(19):2911-22. doi: 10.1093/hmg/ddl232. Epub 2006 Aug 21. Hum Mol Genet. 2006. PMID: 16923797
-
Parallel evolution of conserved non-coding elements that target a common set of developmental regulatory genes from worms to humans.Genome Biol. 2007;8(2):R15. doi: 10.1186/gb-2007-8-2-r15. Genome Biol. 2007. PMID: 17274809 Free PMC article.
-
Tuning in to the signals: noncoding sequence conservation in vertebrate genomes.Trends Genet. 2008 Jul;24(7):344-52. doi: 10.1016/j.tig.2008.04.005. Epub 2008 May 29. Trends Genet. 2008. PMID: 18514361 Review.
-
Pan-vertebrate conserved non-coding sequences associated with developmental regulation.Brief Funct Genomic Proteomic. 2009 Jul;8(4):256-65. doi: 10.1093/bfgp/elp033. Brief Funct Genomic Proteomic. 2009. PMID: 19752044 Review.
Cited by
-
Second transplantation after kidney graft loss in primary hyperoxaluria type 2: a pedigree study and mutation analysis.Ren Fail. 2024 Dec;46(2):2417743. doi: 10.1080/0886022X.2024.2417743. Epub 2024 Oct 24. Ren Fail. 2024. PMID: 39444286 Free PMC article.
-
Reciprocal regulation of Rag expression in thymocytes by the zinc-finger proteins, Zfp608 and Zfp609.Genes Immun. 2013 Jan;14(1):7-12. doi: 10.1038/gene.2012.47. Epub 2012 Oct 18. Genes Immun. 2013. PMID: 23076336 Free PMC article.
-
Sex- and tissue-specific functions of Drosophila doublesex transcription factor target genes.Dev Cell. 2014 Dec 22;31(6):761-73. doi: 10.1016/j.devcel.2014.11.021. Dev Cell. 2014. PMID: 25535918 Free PMC article.
-
Profiling of androgen response in rainbow trout pubertal testis: relevance to male gonad development and spermatogenesis.PLoS One. 2013;8(1):e53302. doi: 10.1371/journal.pone.0053302. Epub 2013 Jan 3. PLoS One. 2013. PMID: 23301058 Free PMC article.
-
Annotation of snoRNA abundance across human tissues reveals complex snoRNA-host gene relationships.Genome Biol. 2021 Jun 4;22(1):172. doi: 10.1186/s13059-021-02391-2. Genome Biol. 2021. PMID: 34088344 Free PMC article.
References
-
- Bejerano, G., Haussler, D., and Blanchette, M. 2004a. Into the heart of darkness: Large-scale clustering of human non-coding DNA. Bioinformatics 20: I40–I48. - PubMed
-
- Bejerano, G., Pheasant, M., Makunin, I., Stephen, S., Kent, W., Mattick, J., and Haussler, D. 2004b. Ultraconserved elements in the human genome. Science 304: 1321–1325. - PubMed
-
- Bergman, C.M. and Kreitman, M. 2001. Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences. Genome Res. 11: 1335–1345. - PubMed
Web site references
-
- http://www.cse.ucsc.edu/~acs/conservation; Supplemental data for this study.
-
- http://genome.ucsc.edu; UC Santa Cruz Genome Browser. - PubMed
-
- http://genome.ucsc.edu/cgi-bin/hgTables; UC Santa Cruz Table Browser.
-
- http://www.genetics.wustl.edu/saccharomycesgenomes/Contigs; download page for yeast sequence data, Washington University, St. Louis.
-
- http://www.broad.mit.edu/ftp/pub/annotation/fungi/comp_yeasts; download page for yeast sequence data, Broad Institute.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases