SAGA: sequence alignment by genetic algorithm
- PMID: 8628686
- PMCID: PMC145823
- DOI: 10.1093/nar/24.8.1515
SAGA: sequence alignment by genetic algorithm
Abstract
We describe a new approach to multiple sequence alignment using genetic algorithms and an associated software package called SAGA. The method involves evolving a population of alignments in a quasi evolutionary manner and gradually improving the fitness of the population as measured by an objective function which measures multiple alignment quality. SAGA uses an automatic scheduling scheme to control the usage of 22 different operators for combining alignments or mutating them between generations. When used to optimise the well known sums of pairs objective function, SAGA performs better than some of the widely used alternative packages. This is seen with respect to the ability to achieve an optimal solution and with regard to the accuracy of alignment by comparison with reference alignments based on sequences of known tertiary structure. The general attraction of the approach is the ability to optimise any objective function that one can invent.
Similar articles
-
RAGA: RNA sequence alignment by genetic algorithm.Nucleic Acids Res. 1997 Nov 15;25(22):4570-80. doi: 10.1093/nar/25.22.4570. Nucleic Acids Res. 1997. PMID: 9358168 Free PMC article.
-
COFFEE: an objective function for multiple sequence alignments.Bioinformatics. 1998 Jun;14(5):407-22. doi: 10.1093/bioinformatics/14.5.407. Bioinformatics. 1998. PMID: 9682054
-
Robust sequence alignment using evolutionary rates coupled with an amino acid substitution matrix.BMC Bioinformatics. 2015 Aug 14;16:255. doi: 10.1186/s12859-015-0688-8. BMC Bioinformatics. 2015. PMID: 26269100 Free PMC article.
-
An adaptive and iterative algorithm for refining multiple sequence alignment.Comput Biol Chem. 2004 Apr;28(2):141-8. doi: 10.1016/j.compbiolchem.2004.02.001. Comput Biol Chem. 2004. PMID: 15130542
-
Determination of reliable regions in protein sequence alignments.Protein Eng. 1990 Jul;3(7):565-9. doi: 10.1093/protein/3.7.565. Protein Eng. 1990. PMID: 2217130 Review.
Cited by
-
A comprehensive benchmark study of multiple sequence alignment methods: current challenges and future perspectives.PLoS One. 2011 Mar 31;6(3):e18093. doi: 10.1371/journal.pone.0018093. PLoS One. 2011. PMID: 21483869 Free PMC article.
-
A Genetic Algorithm for Universal Optimization of Ultrasensitive Surface Plasmon Resonance Sensors with 2D Materials.ACS Omega. 2023 May 26;8(23):20792-20800. doi: 10.1021/acsomega.3c01387. eCollection 2023 Jun 13. ACS Omega. 2023. PMID: 37323412 Free PMC article.
-
Genetic algorithm learning as a robust approach to RNA editing site prediction.BMC Bioinformatics. 2006 Mar 16;7:145. doi: 10.1186/1471-2105-7-145. BMC Bioinformatics. 2006. PMID: 16542417 Free PMC article.
-
Improvement of alignment accuracy utilizing sequentially conserved motifs.BMC Bioinformatics. 2004 Oct 28;5:167. doi: 10.1186/1471-2105-5-167. BMC Bioinformatics. 2004. PMID: 15509307 Free PMC article.
-
Comparative protein structure modeling by iterative alignment, model building and model assessment.Nucleic Acids Res. 2003 Jul 15;31(14):3982-92. doi: 10.1093/nar/gkg460. Nucleic Acids Res. 2003. PMID: 12853614 Free PMC article.
References
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources