Scaffolding pre-assembled contigs using SSPACE
- PMID: 21149342
- DOI: 10.1093/bioinformatics/btq683
Scaffolding pre-assembled contigs using SSPACE
Abstract
De novo assembly tools play a main role in reconstructing genomes from next-generation sequencing (NGS) data and usually yield a number of contigs. Using paired-read sequencing data it is possible to assess the order, distance and orientation of contigs and combine them into so-called scaffolds. Although the latter process is a crucial step in finishing genomes, scaffolding algorithms are often built-in functions in de novo assembly tools and cannot be independently controlled. We here present a new tool, called SSPACE, which is a stand-alone scaffolder of pre-assembled contigs using paired-read data. Main features are: a short runtime, multiple library input of paired-end and/or mate pair datasets and possible contig extension with unmapped sequence reads. SSPACE shows promising results on both prokaryote and eukaryote genomic testsets where the amount of initial contigs was reduced by at least 75%.
Similar articles
-
gapFinisher: A reliable gap filling pipeline for SSPACE-LongRead scaffolder output.PLoS One. 2019 Sep 9;14(9):e0216885. doi: 10.1371/journal.pone.0216885. eCollection 2019. PLoS One. 2019. PMID: 31498807 Free PMC article.
-
ELOPER: elongation of paired-end reads as a pre-processing tool for improved de novo genome assembly.Bioinformatics. 2013 Jun 1;29(11):1455-7. doi: 10.1093/bioinformatics/btt169. Epub 2013 Apr 19. Bioinformatics. 2013. PMID: 23603334
-
Multi-CAR: a tool of contig scaffolding using multiple references.BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):469. doi: 10.1186/s12859-016-1328-7. BMC Bioinformatics. 2016. PMID: 28155633 Free PMC article.
-
Sequence assembly using next generation sequencing data--challenges and solutions.Sci China Life Sci. 2014 Nov;57(11):1140-8. doi: 10.1007/s11427-014-4752-9. Epub 2014 Oct 17. Sci China Life Sci. 2014. PMID: 25326069 Review.
-
A comprehensive review of scaffolding methods in genome assembly.Brief Bioinform. 2021 Sep 2;22(5):bbab033. doi: 10.1093/bib/bbab033. Brief Bioinform. 2021. PMID: 33634311 Review.
Cited by
-
Comparative and phylogenetic analysis of the chloroplast genomes of four commonly used medicinal cultivars of Chrysanthemums morifolium.BMC Plant Biol. 2024 Oct 22;24(1):992. doi: 10.1186/s12870-024-05679-0. BMC Plant Biol. 2024. PMID: 39434004 Free PMC article.
-
Towards the Description of the Genome Catalogue of Pseudomonas sp. Strain M1.Genome Announc. 2013 Jan;1(1):e00146-12. doi: 10.1128/genomeA.00146-12. Epub 2013 Feb 7. Genome Announc. 2013. PMID: 23405299 Free PMC article.
-
Genomic analysis reveals hidden biodiversity within colugos, the sister group to primates.Sci Adv. 2016 Aug 10;2(8):e1600633. doi: 10.1126/sciadv.1600633. eCollection 2016 Aug. Sci Adv. 2016. PMID: 27532052 Free PMC article.
-
Reference-assisted chromosome assembly.Proc Natl Acad Sci U S A. 2013 Jan 29;110(5):1785-90. doi: 10.1073/pnas.1220349110. Epub 2013 Jan 10. Proc Natl Acad Sci U S A. 2013. PMID: 23307812 Free PMC article.
-
Genome sequencing of the perciform fish Larimichthys crocea provides insights into molecular and genetic mechanisms of stress adaptation.PLoS Genet. 2015 Apr 2;11(4):e1005118. doi: 10.1371/journal.pgen.1005118. eCollection 2015 Apr. PLoS Genet. 2015. PMID: 25835551 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources