Noncoding sequences conserved in a limited number of mammals in the SIM2 interval are frequently functional
- PMID: 14962988
- PMCID: PMC353216
- DOI: 10.1101/gr.1961204
Noncoding sequences conserved in a limited number of mammals in the SIM2 interval are frequently functional
Abstract
Cross-species DNA sequence comparison is a fundamental method for identifying biologically important elements, because functional sequences are evolutionarily conserved, wheres nonfunctional sequences drift. A recent genome-wide comparison of human and mouse DNA discovered over 200,000 conserved noncoding sequences with unknown function. Multispecies DNA comparison has been proposed as a method to prioritize these conserved noncoding sequences for functional analysis based on the hypothesis that elements present in many species are more likely to be functional than elements present in limited numbers of species. Here, we perform a comparative analysis of the single-minded 2 (SIM2) gene interval on human chromosome 21 with horse, cow, pig, dog, cat, and mouse DNA. We classify conserved sequences based on the number of mammals in which they are present, and experimentally test sequences in each class for function. As hypothesized, conserved sequences present in many mammals are frequently functional. Additionally, we demonstrate that sequences conserved in a limited number of mammals are also frequently functional. Examination of genomic deletions in chimpanzee and rhesus macaque DNA showed that several putatively functional conserved noncoding human sequences were absent in these primates. These findings suggest that functional conserved noncoding human sequences can be missing in other mammals, even closely related primate species.
Figures
Similar articles
-
Comparison of human chromosome 21 conserved nongenic sequences (CNGs) with the mouse and dog genomes shows that their selective constraint is independent of their genic environment.Genome Res. 2004 May;14(5):852-9. doi: 10.1101/gr.1934904. Epub 2004 Apr 12. Genome Res. 2004. PMID: 15078857 Free PMC article.
-
Parallel construction of orthologous sequence-ready clone contig maps in multiple species.Genome Res. 2002 Aug;12(8):1277-85. doi: 10.1101/gr.283202. Genome Res. 2002. PMID: 12176935 Free PMC article.
-
Accelerated evolution of conserved noncoding sequences in humans.Science. 2006 Nov 3;314(5800):786. doi: 10.1126/science.1130738. Science. 2006. PMID: 17082449
-
Conserved non-genic sequences - an unexpected feature of mammalian genomes.Nat Rev Genet. 2005 Feb;6(2):151-7. doi: 10.1038/nrg1527. Nat Rev Genet. 2005. PMID: 15716910 Review.
-
Bioinformatics for the 'bench biologist': how to find regulatory regions in genomic DNA.Nat Immunol. 2004 Aug;5(8):768-74. doi: 10.1038/ni0804-768. Nat Immunol. 2004. PMID: 15282556 Review.
Cited by
-
Molecular hyperdiversity and evolution in very large populations.Mol Ecol. 2013 Apr;22(8):2074-95. doi: 10.1111/mec.12281. Epub 2013 Mar 18. Mol Ecol. 2013. PMID: 23506466 Free PMC article.
-
Identification and characterization of new long conserved noncoding sequences in vertebrates.Mamm Genome. 2008 Oct-Dec;19(10-12):703-12. doi: 10.1007/s00335-008-9152-7. Epub 2008 Nov 18. Mamm Genome. 2008. PMID: 19015917
-
Transcriptional enhancement by GATA1-occupied DNA segments is strongly associated with evolutionary constraint on the binding site motif.Genome Res. 2008 Dec;18(12):1896-905. doi: 10.1101/gr.083089.108. Epub 2008 Sep 25. Genome Res. 2008. PMID: 18818370 Free PMC article.
-
Functional constraint and divergence in the G protein family in Caenorhabditis elegans and Caenorhabditis briggsae.Mol Genet Genomics. 2005 Jun;273(4):299-310. doi: 10.1007/s00438-004-1105-6. Epub 2005 Apr 27. Mol Genet Genomics. 2005. PMID: 15856303
-
Long-range comparison of human and mouse Sprr loci to identify conserved noncoding sequences involved in coordinate regulation.Genome Res. 2004 Dec;14(12):2430-8. doi: 10.1101/gr.2709404. Genome Res. 2004. PMID: 15574822 Free PMC article.
References
-
- Boffelli, D., McAuliffe, J., Ovcharenko, D., Lewis, K.D., Ovcharenko, I., Pachter, L., and Rubin, E.M. 2003. Phylogenetic shadowing of primate sequences to find functional regions of the human genome. Science 299: 1391–1394. - PubMed
-
- Dermitzakis, E.T., Reymond, A., Lyle, R., Scamuffa, N., Ucla, C., Deutsch, S., Stevenson, B.J., Flegel, V., Bucher, P., Jongeneel, C.V., et al. 2002. Numerous potentially functional but non-genic conserved sequences on human chromosome 21. Nature 420: 578–582. - PubMed
-
- Ema, M., Ikegami, S., Hosoya, T., Mimura, J., Ohtani, H., Nakao, K., Inokuchi, K., Katsuki, M., and Fujii-Kuriyama, Y. 1999. Mild impairment of learning and memory in mice overexpressing the mSim2 gene located on chromosome 16: An animal model of Down's syndrome. Hum. Mol. Genet. 8: 1409–1415. - PubMed
-
- Fahrenkrug, S.C., Rohrer, G.A., Freking, B.A., Smith, T.P., Osoegawa, K., Shu, C.L., Catanese, J.J., and de Jong, P.J. 2001. A porcine BAC library with tenfold genome coverage: A resource for physical and genetic map integration. Mamm. Genome 12: 472–474. - PubMed
WEB SITE REFERENCES
-
- http://bio.cse.psu.edu/genome/hummus/; Whole Genome Human/Mouse Homology Web site.
-
- http://bacpac.chori.org/; BACPAC Resources Center Home Page (Children's Hospital Oakland Research Institute).
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous