Evaluation and improvements in the automatic alignment of protein sequences
- PMID: 3507699
- DOI: 10.1093/protein/1.2.89
Evaluation and improvements in the automatic alignment of protein sequences
Abstract
The accuracy of protein sequence alignment obtained by applying a commonly used global sequence comparison algorithm is assessed. Alignments based on the superposition of the three-dimensional structures are used as a standard for testing the automatic, sequence-based methods. Alignments obtained from the global comparison of five pairs of homologous protein sequences studied gave 54% agreement overall for residues in secondary structures. The inclusion of information about the secondary structure of one of the proteins in order to limit the number of gaps inserted in regions of secondary structure, improved this figure to 68%. A similarity score of greater than six standard deviation units suggests that an alignment which is greater than 75% correct within secondary structural regions can be obtained automatically for the pair of sequences.
Similar articles
-
From analysis of protein structural alignments toward a novel approach to align protein sequences.Proteins. 2004 Feb 15;54(3):569-82. doi: 10.1002/prot.10503. Proteins. 2004. PMID: 14748004
-
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47. BMC Bioinformatics. 2003. PMID: 14552658 Free PMC article.
-
SE: an algorithm for deriving sequence alignment from a pair of superimposed structures.BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2105-10-S1-S4. BMC Bioinformatics. 2009. PMID: 19208141 Free PMC article.
-
Understanding structural relationships in proteins of unsolved three-dimensional structure.Proteins. 1990;7(2):99-111. doi: 10.1002/prot.340070202. Proteins. 1990. PMID: 2183216 Review.
-
The protein structure code: what is its present status?Comput Appl Biosci. 1991 Apr;7(2):133-42. doi: 10.1093/bioinformatics/7.2.133. Comput Appl Biosci. 1991. PMID: 2059837 Review.
Cited by
-
A non-local gap-penalty for profile alignment.Bull Math Biol. 1996 Jan;58(1):1-18. doi: 10.1007/BF02458279. Bull Math Biol. 1996. PMID: 8819751
-
Bridging the gaps in statistical models of protein alignment.Bioinformatics. 2022 Jun 24;38(Suppl 1):i229-i237. doi: 10.1093/bioinformatics/btac246. Bioinformatics. 2022. PMID: 35758809 Free PMC article.
-
Using structure to explore the sequence alignment space of remote homologs.PLoS Comput Biol. 2011 Oct;7(10):e1002175. doi: 10.1371/journal.pcbi.1002175. Epub 2011 Oct 6. PLoS Comput Biol. 2011. PMID: 21998567 Free PMC article.
-
A differential geometric treatment of protein structure comparison.Bull Math Biol. 1994 Sep;56(5):923-43. doi: 10.1007/BF02458274. Bull Math Biol. 1994. PMID: 7920269
-
Characterization of the DNA polymerase gene of human herpesvirus 6.J Virol. 1991 Sep;65(9):4670-80. doi: 10.1128/JVI.65.9.4670-4680.1991. J Virol. 1991. PMID: 1651403 Free PMC article.