PASS: a program to align short sequences
- PMID: 19218350
- DOI: 10.1093/bioinformatics/btp087
PASS: a program to align short sequences
Abstract
Summary: Standard DNA alignment programs are inadequate to manage the data produced by new generation DNA sequencers. To answer this problem, we developed PASS with the objective of improving execution time and sensitivity when compared with other available programs. PASS performs fast gapped and ungapped alignments of short DNA sequences onto a reference DNA, typically a genomic sequence. It is designed to handle a huge amount of reads such as those generated by Solexa, SOLiD or 454 technologies. The algorithm is based on a data structure that holds in RAM the index of the genomic positions of 'seed' words (typically 11 and 12 bases) as well as an index of the precomputed scores of short words (typically seven and eight bases) aligned against each other. After building the genomic index, the program scans every query sequence performing three steps: (1) it finds matching seed words in the genome; (2) for every match checks the precomputed alignment of the short flanking regions; (3) if passes step 2, then it performs an exact dynamic alignment of a narrow region around the match. The performance of the program is very striking both for sensitivity and speed. For instance, gap alignment is achieved hundreds of times faster than BLAST and several times faster than SOAP, especially when gaps are allowed. Furthermore, PASS has a higher sensitivity when compared with the other available programs.
Availability and implementation: Source code and binaries are freely available for download at http://pass.cribi.unipd.it, implemented in C++and supported on Linux and Windows.
Similar articles
-
GASSST: global alignment short sequence search tool.Bioinformatics. 2010 Oct 15;26(20):2534-40. doi: 10.1093/bioinformatics/btq485. Epub 2010 Aug 24. Bioinformatics. 2010. PMID: 20739310 Free PMC article.
-
Exact and complete short-read alignment to microbial genomes using Graphics Processing Unit programming.Bioinformatics. 2011 May 15;27(10):1351-8. doi: 10.1093/bioinformatics/btr151. Epub 2011 Mar 30. Bioinformatics. 2011. PMID: 21450712
-
PASS-bis: a bisulfite aligner suitable for whole methylome analysis of Illumina and SOLiD reads.Bioinformatics. 2013 Jan 15;29(2):268-70. doi: 10.1093/bioinformatics/bts675. Epub 2012 Nov 17. Bioinformatics. 2013. PMID: 23162053
-
ProbeMatch: rapid alignment of oligonucleotides to genome allowing both gaps and mismatches.Bioinformatics. 2009 Jun 1;25(11):1424-5. doi: 10.1093/bioinformatics/btp178. Epub 2009 Apr 7. Bioinformatics. 2009. PMID: 19351619 Free PMC article.
-
Fast and accurate short read alignment with Burrows-Wheeler transform.Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18. Bioinformatics. 2009. PMID: 19451168 Free PMC article.
Cited by
-
mebipred: identifying metal-binding potential in protein sequence.Bioinformatics. 2022 Jul 11;38(14):3532-3540. doi: 10.1093/bioinformatics/btac358. Bioinformatics. 2022. PMID: 35639953 Free PMC article.
-
Identification of Known and Novel Arundo donax L. MicroRNAs and Their Targets Using High-Throughput Sequencing and Degradome Analysis.Life (Basel). 2022 Apr 27;12(5):651. doi: 10.3390/life12050651. Life (Basel). 2022. PMID: 35629319 Free PMC article.
-
Technology dictates algorithms: recent developments in read alignment.Genome Biol. 2021 Aug 26;22(1):249. doi: 10.1186/s13059-021-02443-7. Genome Biol. 2021. PMID: 34446078 Free PMC article. Review.
-
Drought stress modulates cuticular wax composition of the grape berry.J Exp Bot. 2020 May 30;71(10):3126-3141. doi: 10.1093/jxb/eraa046. J Exp Bot. 2020. PMID: 31985780 Free PMC article.
-
High-throughput sequencing of the chloroplast and mitochondrion of Chlamydomonas reinhardtii to generate improved de novo assemblies, analyze expression patterns and transcript speciation, and evaluate diversity among laboratory strains and wild isolates.Plant J. 2018 Feb;93(3):545-565. doi: 10.1111/tpj.13788. Epub 2018 Jan 7. Plant J. 2018. PMID: 29172250 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous