Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Nov 15;32(22):3507-3509.
doi: 10.1093/bioinformatics/btw490. Epub 2016 Jul 27.

WALT: fast and accurate read mapping for bisulfite sequencing

Affiliations

WALT: fast and accurate read mapping for bisulfite sequencing

Haifeng Chen et al. Bioinformatics. .

Abstract

Whole-genome bisulfite sequencing (WGBS) has emerged as the gold-standard technique in genome-scale studies of DNA methylation. Mapping reads from WGBS requires unique considerations that make the process more time-consuming than in other sequencing applications. Typical WGBS data sets contain several hundred million reads, adding to this analysis challenge. We present the WALT tool for mapping WGBS reads. WALT uses a strategy of hashing periodic spaced seeds, which leads to significant speedup compared with the most efficient methods currently available. Although many existing WGBS mappers slow down with read length, WALT improves in speed. Importantly, these speed gains do not sacrifice accuracy.

Availability and implementation: WALT is available under the GPL v3 license, and downloadable from https://github.com/smithlabcode/walt.

Contact: andrewds@usc.edu or tingchen@usc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Runtime as a function of read length

Similar articles

Cited by

References

    1. Bock C. (2012) Analysing and interpreting DNA methylation data. Nat. Rev. Gen., 13, 705–719. - PubMed
    1. Chen Y. et al. (2009) PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds. Bioinformatics, 25, 2514–2521. - PMC - PubMed
    1. Fortin J.P. et al. (2014) Functional normalization of 450k methylation array data improves replication in large cancer studies. Genome Biol., 15, 503. - PMC - PubMed
    1. Hach F. et al. (2010) mrsFAST: a cache-oblivious algorithm for short-read mapping. Nat. Methods, 7, 576–577. - PMC - PubMed
    1. Krueger F., Andrews S.R. (2011) Bismark: a flexible aligner and methylation caller for bisulfite-seq applications. Bioinformatics, 27, 1571–1572. - PMC - PubMed