Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Sep 15;31(18):2930-8.
doi: 10.1093/bioinformatics/btv317. Epub 2015 May 21.

Bayesian mixture analysis for metagenomic community profiling

Affiliations

Bayesian mixture analysis for metagenomic community profiling

Sofia Morfopoulou et al. Bioinformatics. .

Abstract

Motivation: Deep sequencing of clinical samples is now an established tool for the detection of infectious pathogens, with direct medical applications. The large amount of data generated produces an opportunity to detect species even at very low levels, provided that computational tools can effectively profile the relevant metagenomic communities. Data interpretation is complicated by the fact that short sequencing reads can match multiple organisms and by the lack of completeness of existing databases, in particular for viral pathogens. Here we present metaMix, a Bayesian mixture model framework for resolving complex metagenomic mixtures. We show that the use of parallel Monte Carlo Markov chains for the exploration of the species space enables the identification of the set of species most likely to contribute to the mixture.

Results: We demonstrate the greater accuracy of metaMix compared with relevant methods, particularly for profiling complex communities consisting of several related species. We designed metaMix specifically for the analysis of deep transcriptome sequencing datasets, with a focus on viral pathogen detection; however, the principles are generally applicable to all types of metagenomic mixtures.

Availability and implementation: metaMix is implemented as a user friendly R package, freely available on CRAN: http://cran.r-project.org/web/packages/metaMix

Contact: sofia.morfopoulou.10@ucl.ac.uk

Supplementary information: Supplementary data are available at Bionformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
a. Log-likelihood trace plot for single chain MCMC and b. for PT chain at temperature T = 1. c. Schematic of parallel tempering. Exchanges are attempted between chains of neighboring temperatures, where Chain1 at T1=1,T1<T2<T3<T4
Fig. 2.
Fig. 2.
Human clinical sample - novel virus. The reads (short lines) assigned by metaMix to Astrovirus VA1 are aligned to the genome. The longer lines represent the genes of the virus

Similar articles

Cited by

References

    1. Altschul S.F., et al. (1990) Basic local alignment search tool. J. Mol. Biol., 215, 403–410. - PubMed
    1. Barzon L., et al. (2013) Next-generation sequencing technologies in diagnostic virology. J. Clin. Virol., 58, 346–350. - PubMed
    1. Brady A., Salzberg S.L. (2009) Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models. Nat. Methods, 6, 673–676. - PMC - PubMed
    1. Brown J.R., et al. (2014) Astrovirus VA1/HMO-C: an increasingly recognised neurotropic pathogen in immunocompromised patients. Clin. Infect. Dis., 60, 881–888. - PMC - PubMed
    1. Chiu C.Y. (2013) Viral pathogen discovery. Curr. Opin. Microbiol., 16, 468–478. - PMC - PubMed

Publication types