MetaSim: a sequencing simulator for genomics and metagenomics
- PMID: 18841204
- PMCID: PMC2556396
- DOI: 10.1371/journal.pone.0003373
MetaSim: a sequencing simulator for genomics and metagenomics
Abstract
Background: The new research field of metagenomics is providing exciting insights into various, previously unclassified ecological systems. Next-generation sequencing technologies are producing a rapid increase of environmental data in public databases. There is great need for specialized software solutions and statistical methods for dealing with complex metagenome data sets.
Methodology/principal findings: To facilitate the development and improvement of metagenomic tools and the planning of metagenomic projects, we introduce a sequencing simulator called MetaSim. Our software can be used to generate collections of synthetic reads that reflect the diverse taxonomical composition of typical metagenome data sets. Based on a database of given genomes, the program allows the user to design a metagenome by specifying the number of genomes present at different levels of the NCBI taxonomy, and then to collect reads from the metagenome using a simulation of a number of different sequencing technologies. A population sampler optionally produces evolved sequences based on source genomes and a given evolutionary tree.
Conclusions/significance: MetaSim allows the user to simulate individual read datasets that can be used as standardized test scenarios for planning sequencing projects or for benchmarking metagenomic software.
Conflict of interest statement
Figures
Similar articles
-
MEGAN analysis of metagenomic data.Genome Res. 2007 Mar;17(3):377-86. doi: 10.1101/gr.5969107. Epub 2007 Jan 25. Genome Res. 2007. PMID: 17255551 Free PMC article.
-
Use of simulated data sets to evaluate the fidelity of metagenomic processing methods.Nat Methods. 2007 Jun;4(6):495-500. doi: 10.1038/nmeth1043. Epub 2007 Apr 29. Nat Methods. 2007. PMID: 17468765
-
Megx.net--database resources for marine ecological genomics.Nucleic Acids Res. 2006 Jan 1;34(Database issue):D390-3. doi: 10.1093/nar/gkj070. Nucleic Acids Res. 2006. PMID: 16381894 Free PMC article.
-
Classification of metagenomic sequences: methods and challenges.Brief Bioinform. 2012 Nov;13(6):669-81. doi: 10.1093/bib/bbs054. Epub 2012 Sep 8. Brief Bioinform. 2012. PMID: 22962338 Review.
-
[A review on the bioinformatics pipelines for metagenomic research].Dongwuxue Yanjiu. 2012 Dec;33(6):574-85. doi: 10.3724/SP.J.1141.2012.06574. Dongwuxue Yanjiu. 2012. PMID: 23266976 Review. Chinese.
Cited by
-
MBBC: an efficient approach for metagenomic binning based on clustering.BMC Bioinformatics. 2015 Feb 5;16:36. doi: 10.1186/s12859-015-0473-8. BMC Bioinformatics. 2015. PMID: 25652152 Free PMC article.
-
Comparison of metagenomic samples using sequence signatures.BMC Genomics. 2012 Dec 27;13:730. doi: 10.1186/1471-2164-13-730. BMC Genomics. 2012. PMID: 23268604 Free PMC article.
-
Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools.BMC Bioinformatics. 2018 Aug 30;19(1):309. doi: 10.1186/s12859-018-2320-1. BMC Bioinformatics. 2018. PMID: 30165813 Free PMC article.
-
Taxonomic classification of metagenomic shotgun sequences with CARMA3.Nucleic Acids Res. 2011 Aug;39(14):e91. doi: 10.1093/nar/gkr225. Epub 2011 May 17. Nucleic Acids Res. 2011. PMID: 21586583 Free PMC article.
-
NeSSM: a Next-generation Sequencing Simulator for Metagenomics.PLoS One. 2013 Oct 4;8(10):e75448. doi: 10.1371/journal.pone.0075448. eCollection 2013. PLoS One. 2013. PMID: 24124490 Free PMC article.
References
-
- Tringe SG, von Mering C, Kobayashi A, Salamov AA, Chen K, et al. Comparative Metagenomics of Microbial Communities. Science. 2005;308:554–557. - PubMed
-
- Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, et al. Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature. 2004;428:37–43. - PubMed
-
- Turnbaugh PJ, Ley RE, Mahowald MA, Magrini V, Mardis ER, et al. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature. 2006;444:1027–1031. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources