Finding weak motifs in DNA sequences
- PMID: 11928479
- DOI: 10.1142/9789812799623_0022
Finding weak motifs in DNA sequences
Abstract
Recognition of regulatory sites in unaligned DNA sequences is an old and well-studied problem in computational molecular biology. Recently, large-scale expression studies and comparative genomics brought this problem into a spotlight by generating a large number of samples with unknown regulatory signals. Here we develop algorithms for recognition of signals in corrupted samples (where only a fraction of sequences contain sites) with biased nucleotide composition. We further benchmark these and other algorithms on several bacterial and archaeal sites in a setting specifically designed to imitate the situations arising in comparative genomics studies.
Similar articles
-
Parsing regulatory DNA: general tasks, techniques, and the PhyloGibbs approach.J Biosci. 2007 Aug;32(5):863-70. doi: 10.1007/s12038-007-0086-0. J Biosci. 2007. PMID: 17914228 Review.
-
Finding composite regulatory patterns in DNA sequences.Bioinformatics. 2002;18 Suppl 1:S354-63. doi: 10.1093/bioinformatics/18.suppl_1.s354. Bioinformatics. 2002. PMID: 12169566
-
Identification of promoter regions and regulatory sites.Methods Mol Biol. 2010;674:57-83. doi: 10.1007/978-1-60761-854-6_5. Methods Mol Biol. 2010. PMID: 20827586
-
An improved heuristic algorithm for finding motif signals in DNA sequences.IEEE/ACM Trans Comput Biol Bioinform. 2011 Jul-Aug;8(4):959-75. doi: 10.1109/TCBB.2010.92. IEEE/ACM Trans Comput Biol Bioinform. 2011. PMID: 20855921
-
Bioinformatics for the 'bench biologist': how to find regulatory regions in genomic DNA.Nat Immunol. 2004 Aug;5(8):768-74. doi: 10.1038/ni0804-768. Nat Immunol. 2004. PMID: 15282556 Review.
Cited by
-
More robust detection of motifs in coexpressed genes by using phylogenetic information.BMC Bioinformatics. 2006 Mar 20;7:160. doi: 10.1186/1471-2105-7-160. BMC Bioinformatics. 2006. PMID: 16549017 Free PMC article.
-
Motif discovery and transcription factor binding sites before and after the next-generation sequencing era.Brief Bioinform. 2013 Mar;14(2):225-37. doi: 10.1093/bib/bbs016. Epub 2012 Apr 19. Brief Bioinform. 2013. PMID: 22517426 Free PMC article.
-
Quantitative evaluation of protein-DNA interactions using an optimized knowledge-based potential.Nucleic Acids Res. 2005 Jan 26;33(2):546-58. doi: 10.1093/nar/gki204. Print 2005. Nucleic Acids Res. 2005. PMID: 15673715 Free PMC article.
-
Assessment of composite motif discovery methods.BMC Bioinformatics. 2008 Feb 26;9:123. doi: 10.1186/1471-2105-9-123. BMC Bioinformatics. 2008. PMID: 18302777 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources