BLAT--the BLAST-like alignment tool
- PMID: 11932250
- PMCID: PMC187518
- DOI: 10.1101/gr.229202
BLAT--the BLAST-like alignment tool
Abstract
Analyzing vertebrate genomes requires rapid mRNA/DNA and cross-species protein alignments. A new tool, BLAT, is more accurate and 500 times faster than popular existing tools for mRNA/DNA alignments and 50 times faster for protein alignments at sensitivity settings typically used when comparing vertebrate sequences. BLAT's speed stems from an index of all nonoverlapping K-mers in the genome. This index fits inside the RAM of inexpensive computers, and need only be computed once for each genome assembly. BLAT has several major stages. It uses the index to find regions in the genome likely to be homologous to the query sequence. It performs an alignment between homologous regions. It stitches together these aligned regions (often exons) into larger alignments (typically genes). Finally, BLAT revisits small internal exons possibly missed at the first stage and adjusts large gap boundaries that have canonical splice sites where feasible. This paper describes how BLAT was optimized. Effects on speed and sensitivity are explored for various K-mer sizes, mismatch schemes, and number of required index matches. BLAT is compared with other alignment programs on various test sets and then used in several genome-wide applications. http://genome.ucsc.edu hosts a web-based BLAT server for the human genome.
Figures
Similar articles
-
Using BLAT to find sequence similarity in closely related genomes.Curr Protoc Bioinformatics. 2012 Mar;Chapter 10:10.8.1-10.8.24. doi: 10.1002/0471250953.bi1008s37. Curr Protoc Bioinformatics. 2012. PMID: 22389010 Free PMC article.
-
pblat: a multithread blat algorithm speeding up aligning sequences to genomes.BMC Bioinformatics. 2019 Jan 15;20(1):28. doi: 10.1186/s12859-019-2597-8. BMC Bioinformatics. 2019. PMID: 30646844 Free PMC article.
-
Rapid detection and curation of conserved DNA via enhanced-BLAT and EvoPrinterHD analysis.BMC Genomics. 2008 Feb 28;9:106. doi: 10.1186/1471-2164-9-106. BMC Genomics. 2008. PMID: 18307801 Free PMC article.
-
Estimating overannotation across prokaryotic genomes using BLAST+, UBLAST, LAST and BLAT.BMC Res Notes. 2014 Sep 16;7:651. doi: 10.1186/1756-0500-7-651. BMC Res Notes. 2014. PMID: 25228073 Free PMC article.
-
Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988. Bioinformatics. 2000. PMID: 11159310
Cited by
-
Reverse-transcribed SARS-CoV-2 RNA can integrate into the genome of cultured human cells and can be expressed in patient-derived tissues.Proc Natl Acad Sci U S A. 2021 May 25;118(21):e2105968118. doi: 10.1073/pnas.2105968118. Proc Natl Acad Sci U S A. 2021. PMID: 33958444 Free PMC article.
-
Integrated analysis of whole genome and transcriptome sequencing reveals diverse transcriptomic aberrations driven by somatic genomic changes in liver cancers.PLoS One. 2014 Dec 19;9(12):e114263. doi: 10.1371/journal.pone.0114263. eCollection 2014. PLoS One. 2014. PMID: 25526364 Free PMC article.
-
An Rtf2 Domain-Containing Protein Influences Pre-mRNA Splicing and Is Essential for Embryonic Development in Arabidopsis thaliana.Genetics. 2015 Jun;200(2):523-35. doi: 10.1534/genetics.115.176438. Epub 2015 Mar 27. Genetics. 2015. PMID: 25819795 Free PMC article.
-
The genomes of two key bumblebee species with primitive eusocial organization.Genome Biol. 2015 Apr 24;16(1):76. doi: 10.1186/s13059-015-0623-3. Genome Biol. 2015. PMID: 25908251 Free PMC article.
-
Wolfberry genomes and the evolution of Lycium (Solanaceae).Commun Biol. 2021 Jun 3;4(1):671. doi: 10.1038/s42003-021-02152-8. Commun Biol. 2021. PMID: 34083720 Free PMC article.
References
-
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. - PubMed
-
- Chao KM, Pearson WR, Miller W. Aligning two sequences within a specified diagonal band. Comput Appl Biosci. 1992;8:481–487. - PubMed
-
- Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt AR, Collins JE, Bruskiewich R, Beare DM, Clamp M, Smink LJ, et al. The DNA sequence of human chromosome 22. Nature. 1999;402:489–495. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous