Exhaustive search for over-represented DNA sequence motifs with CisFinder
- PMID: 19740934
- PMCID: PMC2762409
- DOI: 10.1093/dnares/dsp014
Exhaustive search for over-represented DNA sequence motifs with CisFinder
Abstract
We present CisFinder software, which generates a comprehensive list of motifs enriched in a set of DNA sequences and describes them with position frequency matrices (PFMs). A new algorithm was designed to estimate PFMs directly from counts of n-mer words with and without gaps; then PFMs are extended over gaps and flanking regions and clustered to generate non-redundant sets of motifs. The algorithm successfully identified binding motifs for 12 transcription factors (TFs) in embryonic stem cells based on published chromatin immunoprecipitation sequencing data. Furthermore, CisFinder successfully identified alternative binding motifs of TFs (e.g. POU5F1, ESRRB, and CTCF) and motifs for known and unknown co-factors of genes associated with the pluripotent state of ES cells. CisFinder also showed robust performance in the identification of motifs that were only slightly enriched in a set of DNA sequences.
Figures
Similar articles
-
A new exhaustive method and strategy for finding motifs in ChIP-enriched regions.PLoS One. 2014 Jan 24;9(1):e86044. doi: 10.1371/journal.pone.0086044. eCollection 2014. PLoS One. 2014. PMID: 24475069 Free PMC article.
-
FISim: a new similarity measure between transcription factor binding sites based on the fuzzy integral.BMC Bioinformatics. 2009 Jul 20;10:224. doi: 10.1186/1471-2105-10-224. BMC Bioinformatics. 2009. PMID: 19615102 Free PMC article.
-
Identification of Predictive Cis-Regulatory Elements Using a Discriminative Objective Function and a Dynamic Search Space.PLoS One. 2015 Oct 14;10(10):e0140557. doi: 10.1371/journal.pone.0140557. eCollection 2015. PLoS One. 2015. PMID: 26465884 Free PMC article.
-
Simultaneously learning DNA motif along with its position and sequence rank preferences through expectation maximization algorithm.J Comput Biol. 2013 Mar;20(3):237-48. doi: 10.1089/cmb.2012.0233. J Comput Biol. 2013. PMID: 23461573
-
An algorithmic perspective of de novo cis-regulatory motif finding based on ChIP-seq data.Brief Bioinform. 2018 Sep 28;19(5):1069-1081. doi: 10.1093/bib/bbx026. Brief Bioinform. 2018. PMID: 28334268 Review.
Cited by
-
Selective influence of Sox2 on POU transcription factor binding in embryonic and neural stem cells.EMBO Rep. 2015 Sep;16(9):1177-91. doi: 10.15252/embr.201540467. Epub 2015 Aug 11. EMBO Rep. 2015. PMID: 26265007 Free PMC article.
-
A genome-wide RNAi screen reveals determinants of human embryonic stem cell identity.Nature. 2010 Nov 11;468(7321):316-20. doi: 10.1038/nature09531. Epub 2010 Oct 17. Nature. 2010. PMID: 20953172
-
Targeting Lin28 axis enhances glypican-3-CAR T cell efficacy against hepatic tumor initiating cell population.Mol Ther. 2023 Mar 1;31(3):715-728. doi: 10.1016/j.ymthe.2023.01.002. Epub 2023 Jan 6. Mol Ther. 2023. PMID: 36609146 Free PMC article.
-
Discovery of CTCF-sensitive Cis-spliced fusion RNAs between adjacent genes in human prostate cells.PLoS Genet. 2015 Feb 6;11(2):e1005001. doi: 10.1371/journal.pgen.1005001. eCollection 2015 Feb. PLoS Genet. 2015. PMID: 25658338 Free PMC article.
-
PPARG binding landscapes in macrophages suggest a genome-wide contribution of PU.1 to divergent PPARG binding in human and mouse.PLoS One. 2012;7(10):e48102. doi: 10.1371/journal.pone.0048102. Epub 2012 Oct 31. PLoS One. 2012. PMID: 23118933 Free PMC article.
References
-
- Stoltenburg R., Reinemann C., Strehlitz B. SELEX–a (r)evolutionary method to generate high-affinity nucleic acid ligands. Biomol. Eng. 2007;24:381–403. - PubMed
-
- Barski A., Cuddapah S., Cui K., et al. High-resolution profiling of histone methylations in the human genome. Cell. 2007;129:823–37. - PubMed
-
- Johnson D.S., Mortazavi A., Myers R.M., Wold B. Genome-wide mapping of in vivo protein–DNA interactions. Science. 2007;316:1497–502. - PubMed
-
- Robertson G., Hirst M., Bainbridge M., et al. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods. 2007;4:651–7. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous