AlienTrimmer: a tool to quickly and accurately trim off multiple short contaminant sequences from high-throughput sequencing reads
- PMID: 23912058
- DOI: 10.1016/j.ygeno.2013.07.011
AlienTrimmer: a tool to quickly and accurately trim off multiple short contaminant sequences from high-throughput sequencing reads
Abstract
Contaminant oligonucleotide sequences such as primers and adapters can occur in both ends of high-throughput sequencing (HTS) reads. AlienTrimmer was developed in order to detect and remove such contaminants. Based on the decomposition of specified alien nucleotide sequences into k-mers, AlienTrimmer is able to determine whether such alien k-mers are occurring in one or in both read ends by using a simple polynomial algorithm. Therefore, AlienTrimmer can process typical HTS single- or paired-end files with millions of reads in several minutes with very low computer resources. Based on the analysis of both simulated and real-case Illumina®, 454™ and Ion Torrent™ read data, we show that AlienTrimmer performs with excellent accuracy and speed in comparison with other trimming tools. The program is freely available at ftp://ftp.pasteur.fr/pub/gensoft/projects/AlienTrimmer/.
Keywords: Adapter oligonucleotides; High-throughput sequencing; Polynomial algorithm; Raw read trimming; Short contaminant sequence; k-mer decomposition.
© 2013.
Similar articles
-
Btrim: a fast, lightweight adapter and quality trimming program for next-generation sequencing technologies.Genomics. 2011 Aug;98(2):152-3. doi: 10.1016/j.ygeno.2011.05.009. Epub 2011 May 30. Genomics. 2011. PMID: 21651976
-
cutPrimers: A New Tool for Accurate Cutting of Primers from Reads of Targeted Next Generation Sequencing.J Comput Biol. 2017 Nov;24(11):1138-1143. doi: 10.1089/cmb.2017.0096. Epub 2017 Jul 17. J Comput Biol. 2017. PMID: 28715235
-
Analysis of high-throughput ancient DNA sequencing data.Methods Mol Biol. 2012;840:197-228. doi: 10.1007/978-1-61779-516-9_23. Methods Mol Biol. 2012. PMID: 22237537
-
Prevention, diagnosis and treatment of high-throughput sequencing data pathologies.Mol Ecol. 2014 Apr;23(7):1679-700. doi: 10.1111/mec.12680. Epub 2014 Mar 13. Mol Ecol. 2014. PMID: 24471475 Review.
-
Alignment of Next-Generation Sequencing Reads.Annu Rev Genomics Hum Genet. 2015;16:133-51. doi: 10.1146/annurev-genom-090413-025358. Epub 2015 May 4. Annu Rev Genomics Hum Genet. 2015. PMID: 25939052 Review.
Cited by
-
Genomic analysis of Vibrio cholerae O1 isolates from cholera cases, Europe, 2022.Euro Surveill. 2024 Sep;29(36):2400069. doi: 10.2807/1560-7917.ES.2024.29.36.2400069. Euro Surveill. 2024. PMID: 39239731 Free PMC article.
-
Detection of Alpha, Beta, Gamma, and Unclassified Human Papillomaviruses in Cervical Cancer Samples From Mexican Women.Front Cell Infect Microbiol. 2020 Jun 9;10:234. doi: 10.3389/fcimb.2020.00234. eCollection 2020. Front Cell Infect Microbiol. 2020. PMID: 32582561 Free PMC article.
-
Effects of amino acids on the lignocellulose degradation by Aspergillus fumigatus Z5: insights into performance, transcriptional, and proteomic profiles.Biotechnol Biofuels. 2019 Jan 4;12:4. doi: 10.1186/s13068-018-1350-2. eCollection 2019. Biotechnol Biofuels. 2019. PMID: 30622646 Free PMC article.
-
An RNA-Binding Protein Secreted by a Bacterial Pathogen Modulates RIG-I Signaling.Cell Host Microbe. 2019 Dec 11;26(6):823-835.e11. doi: 10.1016/j.chom.2019.10.004. Epub 2019 Nov 21. Cell Host Microbe. 2019. PMID: 31761719 Free PMC article.
-
Antisense transcriptional interference mediates condition-specific gene repression in budding yeast.Nucleic Acids Res. 2018 Jul 6;46(12):6009-6025. doi: 10.1093/nar/gky342. Nucleic Acids Res. 2018. PMID: 29788449 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous