Simultaneous sequence alignment and tree construction using hidden Markov models
- PMID: 12603027
Simultaneous sequence alignment and tree construction using hidden Markov models
Abstract
We present a new algorithm (SATCHMO) that simultaneously estimates a tree and generates a set of multiple sequence alignments given a set of protein sequences. Alignments are constructed for each node in the tree. These alignments predict the structurally conserved elements of the sequences in a subtree and are therefore of different lengths, and represent different amino acid preferences, at different nodes. Hidden Markov Models (HMMs) are also generated for each node and are used to determine branching order, to align sequences and to predict structurally alignable regions. In experiments on the BAliBASE benchmark alignment database, SATCHMO is shown to perform comparably to ClustalW and the UCSC SAM HMM software. Results using SATCHMO to identify protein domains are demonstrated on potassium channels, with implications for the mechanism by which tumor necrosis factor alpha affects potassium current.
Similar articles
-
SATCHMO: sequence alignment and tree construction using hidden Markov models.Bioinformatics. 2003 Jul 22;19(11):1404-11. doi: 10.1093/bioinformatics/btg158. Bioinformatics. 2003. PMID: 12874053
-
SATCHMO-JS: a webserver for simultaneous protein multiple sequence alignment and phylogenetic tree construction.Nucleic Acids Res. 2010 Jul;38(Web Server issue):W29-34. doi: 10.1093/nar/gkq298. Epub 2010 Apr 29. Nucleic Acids Res. 2010. PMID: 20430824 Free PMC article.
-
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47. BMC Bioinformatics. 2003. PMID: 14552658 Free PMC article.
-
Profile hidden Markov models.Bioinformatics. 1998;14(9):755-63. doi: 10.1093/bioinformatics/14.9.755. Bioinformatics. 1998. PMID: 9918945 Review.
-
Hidden Markov Models for prediction of protein features.Methods Mol Biol. 2008;413:173-98. doi: 10.1007/978-1-59745-574-9_7. Methods Mol Biol. 2008. PMID: 18075166 Review.
Cited by
-
Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments.Proteins. 2005 Feb 1;58(2):321-8. doi: 10.1002/prot.20308. Proteins. 2005. PMID: 15523666 Free PMC article.
-
Alignment of protein sequences by their profiles.Protein Sci. 2004 Apr;13(4):1071-87. doi: 10.1110/ps.03379804. Protein Sci. 2004. PMID: 15044736 Free PMC article.
-
Comparative protein structure modeling by iterative alignment, model building and model assessment.Nucleic Acids Res. 2003 Jul 15;31(14):3982-92. doi: 10.1093/nar/gkg460. Nucleic Acids Res. 2003. PMID: 12853614 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources