Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jun 10;9(8):811-4.
doi: 10.1038/nmeth.2066.

Metagenomic microbial community profiling using unique clade-specific marker genes

Affiliations

Metagenomic microbial community profiling using unique clade-specific marker genes

Nicola Segata et al. Nat Methods. .

Abstract

Metagenomic shotgun sequencing data can identify microbes populating a microbial community and their proportions, but existing taxonomic profiling methods are inefficient for increasingly large data sets. We present an approach that uses clade-specific marker genes to unambiguously assign reads to microbial clades more accurately and >50× faster than current approaches. We validated our metagenomic phylogenetic analysis tool, MetaPhlAn, on terabases of short reads and provide the largest metagenomic profiling to date of the human gut. It can be accessed at http://huttenhower.sph.harvard.edu/metaphlan/.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1. Comparison of MetaPhlAn to existing methods
We used 10 total synthetic metagenomes to compare MetaPhlAn to PhymmBL, BLAST, theRITA pipeline, and NBC. a–b) Absolute and root mean squared (rms) errors with respect to 100 total organisms in one synthetic metagenome at the species (a) and class (b) level c) Correlations of inferred and true species abundances on 8 non-evenly distributed synthetic metagenomes;. d) Reads-per-second rates for the tested methods on single CPUs.
Fig. 2
Fig. 2. Composition of healthy vaginal microbiota
MetaPhlAn species and genus abundances and 16S phylotype abundances for 51 healthy vaginal microbiomes from the Human Microbiome Project. Samples were naively grouped by assigning each based on its dominant (>50%) Lactobacillus species or by the absence (<2%) of any Lactobacillus. For each cluster (named from I to V) we report averages across samples for all genera and species as inferred by MetaPhlAn and, for genera, as estimated by the combination of mothur and the RDP classifier (see Methods) applied to 16S rRNA gene sequences from the same specimen.
Fig. 3
Fig. 3. The gut microbiota in asymptomatic Western populations as inferred by MetaPhlAn on 224 samples combining the HMP and MetaHIT cohorts
a) A taxonomic cladogram reporting all clades present in one or both cohorts (≥0.5% abundance in ≥1 sample). Circle size is proportional to the log of average abundance; color represents relative enrichment of the most abundant taxa (≥1% average in ≥1 cohort) between the HMP (139) and MetaHIT (85, healthy only) populations. b–c) Genus- (b) and species-level (c) taxonomic profiles of the most abundant clades hierarchically clustered (average linkage) with the Bray-Curtis similarity reveal sets of samples with similar microbial community compositions. With the exception of the cluster dominated by genus Bacteroides (B. vulgatus and B. ovatus in particular), samples from both studies are present in all groups, confirming substantial consistency of the gut microbiota characterized by independent and geographically distant western-diet asymptomatic cohorts. Only species and genera with at least 7.5% abundances at the 95th percentile of their distribution are reported.

Comment in

Similar articles

Cited by

References

    1. DeLong EF. Nat Rev Microbiol. 2005;3:459–469. - PubMed
    1. Daniel R. Nat Rev Microbiol. 2005;3:470–478. - PubMed
    1. Methé B, et al. Nature. 2012 in press.
    1. Qin J, et al. Nature. 2010;464:59–65. - PMC - PubMed
    1. Ravel J, et al. P Natl Acad Sci USA. 2011;108:4680–4687. - PubMed

Publication types

Substances