MetaSim: a sequencing simulator for genomics and metagenomics
- PMID: 18841204
- PMCID: PMC2556396
- DOI: 10.1371/journal.pone.0003373
MetaSim: a sequencing simulator for genomics and metagenomics
Abstract
Background: The new research field of metagenomics is providing exciting insights into various, previously unclassified ecological systems. Next-generation sequencing technologies are producing a rapid increase of environmental data in public databases. There is great need for specialized software solutions and statistical methods for dealing with complex metagenome data sets.
Methodology/principal findings: To facilitate the development and improvement of metagenomic tools and the planning of metagenomic projects, we introduce a sequencing simulator called MetaSim. Our software can be used to generate collections of synthetic reads that reflect the diverse taxonomical composition of typical metagenome data sets. Based on a database of given genomes, the program allows the user to design a metagenome by specifying the number of genomes present at different levels of the NCBI taxonomy, and then to collect reads from the metagenome using a simulation of a number of different sequencing technologies. A population sampler optionally produces evolved sequences based on source genomes and a given evolutionary tree.
Conclusions/significance: MetaSim allows the user to simulate individual read datasets that can be used as standardized test scenarios for planning sequencing projects or for benchmarking metagenomic software.
Conflict of interest statement
Figures







Similar articles
-
MEGAN analysis of metagenomic data.Genome Res. 2007 Mar;17(3):377-86. doi: 10.1101/gr.5969107. Epub 2007 Jan 25. Genome Res. 2007. PMID: 17255551 Free PMC article.
-
Use of simulated data sets to evaluate the fidelity of metagenomic processing methods.Nat Methods. 2007 Jun;4(6):495-500. doi: 10.1038/nmeth1043. Epub 2007 Apr 29. Nat Methods. 2007. PMID: 17468765
-
Megx.net--database resources for marine ecological genomics.Nucleic Acids Res. 2006 Jan 1;34(Database issue):D390-3. doi: 10.1093/nar/gkj070. Nucleic Acids Res. 2006. PMID: 16381894 Free PMC article.
-
Classification of metagenomic sequences: methods and challenges.Brief Bioinform. 2012 Nov;13(6):669-81. doi: 10.1093/bib/bbs054. Epub 2012 Sep 8. Brief Bioinform. 2012. PMID: 22962338 Review.
-
[A review on the bioinformatics pipelines for metagenomic research].Dongwuxue Yanjiu. 2012 Dec;33(6):574-85. doi: 10.3724/SP.J.1141.2012.06574. Dongwuxue Yanjiu. 2012. PMID: 23266976 Review. Chinese.
Cited by
-
Separating metagenomic short reads into genomes via clustering.Algorithms Mol Biol. 2012 Sep 26;7(1):27. doi: 10.1186/1748-7188-7-27. Algorithms Mol Biol. 2012. PMID: 23009059 Free PMC article.
-
Species identification and profiling of complex microbial communities using shotgun Illumina sequencing of 16S rRNA amplicon sequences.PLoS One. 2013 Apr 8;8(4):e60811. doi: 10.1371/journal.pone.0060811. Print 2013. PLoS One. 2013. PMID: 23579286 Free PMC article.
-
Alignment-free supervised classification of metagenomes by recursive SVM.BMC Genomics. 2013 Sep 22;14:641. doi: 10.1186/1471-2164-14-641. BMC Genomics. 2013. PMID: 24053649 Free PMC article.
-
Inference of isoforms from short sequence reads.J Comput Biol. 2011 Mar;18(3):305-21. doi: 10.1089/cmb.2010.0243. J Comput Biol. 2011. PMID: 21385036 Free PMC article.
-
CAMISIM: simulating metagenomes and microbial communities.Microbiome. 2019 Feb 8;7(1):17. doi: 10.1186/s40168-019-0633-6. Microbiome. 2019. PMID: 30736849 Free PMC article.
References
-
- Tringe SG, von Mering C, Kobayashi A, Salamov AA, Chen K, et al. Comparative Metagenomics of Microbial Communities. Science. 2005;308:554–557. - PubMed
-
- Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, et al. Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature. 2004;428:37–43. - PubMed
-
- Turnbaugh PJ, Ley RE, Mahowald MA, Magrini V, Mardis ER, et al. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature. 2006;444:1027–1031. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources