A universal framework for regulatory element discovery across all genomes and data types
- PMID: 17964271
- PMCID: PMC2900317
- DOI: 10.1016/j.molcel.2007.09.027
A universal framework for regulatory element discovery across all genomes and data types
Abstract
Deciphering the noncoding regulatory genome has proved a formidable challenge. Despite the wealth of available gene expression data, there currently exists no broadly applicable method for characterizing the regulatory elements that shape the rich underlying dynamics. We present a general framework for detecting such regulatory DNA and RNA motifs that relies on directly assessing the mutual information between sequence and gene expression measurements. Our approach makes minimal assumptions about the background sequence model and the mechanisms by which elements affect gene expression. This provides a versatile motif discovery framework, across all data types and genomes, with exceptional sensitivity and near-zero false-positive rates. Applications from yeast to human uncover putative and established transcription-factor binding and miRNA target sites, revealing rich diversity in their spatial configurations, pervasive co-occurrences of DNA and RNA motifs, context-dependent selection for motif avoidance, and the strong impact of posttranscriptional processes on eukaryotic transcriptomes.
Figures
Similar articles
-
MoD Tools: regulatory motif discovery in nucleotide sequences from co-regulated or homologous genes.Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W566-70. doi: 10.1093/nar/gkl285. Nucleic Acids Res. 2006. PMID: 16845071 Free PMC article.
-
High-resolution DNA-binding specificity analysis of yeast transcription factors.Genome Res. 2009 Apr;19(4):556-66. doi: 10.1101/gr.090233.108. Epub 2009 Jan 21. Genome Res. 2009. PMID: 19158363 Free PMC article.
-
Identification of the cis-acting DNA sequence elements regulating the transcription of the Saccharomyces cerevisiae gene encoding TBP, the TATA box binding protein.J Biol Chem. 1994 Nov 11;269(45):28335-46. J Biol Chem. 1994. PMID: 7961772
-
Identification of cis-regulatory elements in gene co-expression networks using A-GLAM.Methods Mol Biol. 2009;541:1-22. doi: 10.1007/978-1-59745-243-4_1. Methods Mol Biol. 2009. PMID: 19381547 Free PMC article. Review.
-
Transcriptional networks: reverse-engineering gene regulation on a global scale.Curr Opin Microbiol. 2004 Dec;7(6):638-46. doi: 10.1016/j.mib.2004.10.009. Curr Opin Microbiol. 2004. PMID: 15556037 Review.
Cited by
-
The contribution of RNA decay quantitative trait loci to inter-individual variation in steady-state gene expression levels.PLoS Genet. 2012;8(10):e1003000. doi: 10.1371/journal.pgen.1003000. Epub 2012 Oct 11. PLoS Genet. 2012. PMID: 23071454 Free PMC article.
-
High hydrostatic pressure activates transcription factors involved in Saccharomyces cerevisiae stress tolerance.Curr Pharm Biotechnol. 2012 Dec;13(15):2712-20. doi: 10.2174/138920112804724891. Curr Pharm Biotechnol. 2012. PMID: 23072392 Free PMC article.
-
Identification of long regulatory elements in the genome of Plasmodium falciparum and other eukaryotes.PLoS Comput Biol. 2021 Apr 16;17(4):e1008909. doi: 10.1371/journal.pcbi.1008909. eCollection 2021 Apr. PLoS Comput Biol. 2021. PMID: 33861755 Free PMC article.
-
Identification of germ cell-specific genes in mammalian meiotic prophase.BMC Bioinformatics. 2013 Feb 27;14:72. doi: 10.1186/1471-2105-14-72. BMC Bioinformatics. 2013. PMID: 23445120 Free PMC article.
-
Endogenous tRNA-Derived Fragments Suppress Breast Cancer Progression via YBX1 Displacement.Cell. 2015 May 7;161(4):790-802. doi: 10.1016/j.cell.2015.02.053. Cell. 2015. PMID: 25957686 Free PMC article.
References
-
- Beer M, Tavazoie S. Predicting gene expression from sequence. Cell. 2004;117:185–198. - PubMed
-
- Bolognese F, Wasner M, Dohna CL, Gurtner A, Ronchi A, Muller H, Manni I, Mossner J, Piaggio G, Mantovani R, Engeland K. The cyclin B2 promoter depends on NF-Y, a trimer whose CCAAT-binding activity is cell-cycle regulated. Oncogene. 1999;18:1845–1853. - PubMed
-
- Bucher P. Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J Mol Biol. 1990;212:563–578. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases