SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent
- PMID: 16855291
- PMCID: PMC1524906
- DOI: 10.1093/nar/gkl486
SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent
Abstract
Many important interactions of proteins are facilitated by short, linear motifs (SLiMs) within a protein's primary sequence. Our aim was to establish robust methods for discovering putative functional motifs. The strongest evidence for such motifs is obtained when the same motifs occur in unrelated proteins, evolving by convergence. In practise, searches for such motifs are often swamped by motifs shared in related proteins that are identical by descent. Prediction of motifs among sets of biologically related proteins, including those both with and without detectable similarity, were made using the TEIRESIAS algorithm. The number of motif occurrences arising through common evolutionary descent were normalized based on treatment of BLAST local alignments. Motifs were ranked according to a score derived from the product of the normalized number of occurrences and the information content. The method was shown to significantly outperform methods that do not discount evolutionary relatedness, when applied to known SLiMs from a subset of the eukaryotic linear motif (ELM) database. An implementation of Multiple Spanning Tree weighting outperformed two other weighting schemes, in a variety of settings.
Figures
Similar articles
-
The SLiMDisc server: short, linear motif discovery in proteins.Nucleic Acids Res. 2007 Jul;35(Web Server issue):W455-9. doi: 10.1093/nar/gkm400. Epub 2007 Jun 18. Nucleic Acids Res. 2007. PMID: 17576682 Free PMC article.
-
Masking residues using context-specific evolutionary conservation significantly improves short linear motif discovery.Bioinformatics. 2009 Feb 15;25(4):443-50. doi: 10.1093/bioinformatics/btn664. Epub 2009 Jan 9. Bioinformatics. 2009. PMID: 19136552
-
DILIMOT: discovery of linear motifs in proteins.Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W350-5. doi: 10.1093/nar/gkl159. Nucleic Acids Res. 2006. PMID: 16845024 Free PMC article.
-
Discovering sequence motifs.Methods Mol Biol. 2008;452:231-51. doi: 10.1007/978-1-60327-159-2_12. Methods Mol Biol. 2008. PMID: 18566768 Review.
-
Bioinformatics Approaches for Predicting Disordered Protein Motifs.Adv Exp Med Biol. 2015;870:291-318. doi: 10.1007/978-3-319-20164-1_9. Adv Exp Med Biol. 2015. PMID: 26387106 Review.
Cited by
-
Large-scale discovery and characterization of protein regulatory motifs in eukaryotes.PLoS One. 2010 Dec 29;5(12):e14444. doi: 10.1371/journal.pone.0014444. PLoS One. 2010. PMID: 21206902 Free PMC article.
-
Chromosome-length genome assembly and structural variations of the primal Basenji dog (Canis lupus familiaris) genome.BMC Genomics. 2021 Mar 16;22(1):188. doi: 10.1186/s12864-021-07493-6. BMC Genomics. 2021. PMID: 33726677 Free PMC article.
-
Whole genome sequencing of a novel, dichloromethane-fermenting Peptococcaceae from an enrichment culture.PeerJ. 2019 Oct 2;7:e7775. doi: 10.7717/peerj.7775. eCollection 2019. PeerJ. 2019. PMID: 31592187 Free PMC article.
-
SPA: Short peptide analyzer of intrinsic disorder status of short peptides.Genes Cells. 2010 Jun;15(6):635-46. doi: 10.1111/j.1365-2443.2010.01407.x. Epub 2010 May 20. Genes Cells. 2010. PMID: 20497238 Free PMC article.
-
Finding motif pairs in the interactions between heterogeneous proteins via bootstrapping and boosting.BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S57. doi: 10.1186/1471-2105-10-S1-S57. BMC Bioinformatics. 2009. PMID: 19208160 Free PMC article.
References
-
- Munro S., Pelham H.R. A C-terminal signal prevents secretion of luminal ER proteins. Cell. 1987;48:899–907. - PubMed
-
- Furmanek A., Hofsteenge J. Protein C-mannosylation: facts and questions. Acta. Biochim. Pol. 2000;47:781–789. - PubMed
-
- Puntervoll P., Linding R., Gemund C., Chabanis-Davidson S., Mattingsdal M., Cameron S., Martin D.M., Ausiello G., Brannetti B., Costantini A., et al. ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins. Nucleic Acids Res. 2003;31:3625–3630. - PMC - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials