Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 May 17:2024.05.14.594186.
doi: 10.1101/2024.05.14.594186.

ADAPT: Analysis of Microbiome Differential Abundance by Pooling Tobit Models

Affiliations

ADAPT: Analysis of Microbiome Differential Abundance by Pooling Tobit Models

Mukai Wang et al. bioRxiv. .

Abstract

Microbiome differential abundance analysis remains a challenging problem despite multiple methods proposed in the literature. The excessive zeros and compositionality of metagenomics data are two main challenges for differential abundance analysis. We propose a novel method called "analysis of differential abundance by pooling Tobit models" (ADAPT) to overcome these two challenges. ADAPT uniquely treats zero counts as left-censored observations to facilitate computation and enhance interpretation. ADAPT also encompasses a theoretically justified way of selecting non-differentially abundant microbiome taxa as a reference for hypothesis testing. We generate synthetic data using independent simulation frameworks to show that ADAPT has more consistent false discovery rate control and higher statistical power than competitors. We use ADAPT to analyze 16S rRNA sequencing of saliva samples and shotgun metagenomics sequencing of plaque samples collected from infants in the COHRA2 study. The results provide novel insights into the association between the oral microbiome and early childhood dental caries.

PubMed Disclaimer

Conflict of interest statement

9Competing Interests The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Illustration of ADAPT with a toy example.
(a) Three microbiome taxa (taxon 1, taxon 4, and taxon 7) are differentially abundant between two ecosystems. Neither the observed counts nor the relative abundances can be directly compared for differential abundance analysis. (b) ADAPT treats zero counts as left-censored at the detection limit (one in this case). ADAPT first calculates the fold change of relative abundances. It then selects a subset of taxa (taxon 2, 3, and 5) whose fold changes equal the median as reference taxa. After scaling the counts by the sum of three reference taxa, ADAPT can recover the DA taxa without false positives by comparing the normalized counts.
Fig. 2
Fig. 2. Simulation studies for comparing ADAPT with eight other differential abundance analysis methods.
We simulate synthetic metagenomics sequencing count data under two contrasting conditions using the SparseDOSSA framework. The number of samples is the same between the two conditions. We generate 500 replicates for each simulation setting and report the mean of performance metrics. (a) False positive rates (type I errors) of all methods except for ANCOM under simulation settings with no DA taxa. The total number of taxa is 500. The total sample size is 50 or 100. The average library size is the same (balanced) for two conditions at 104 or different (unbalanced) between two conditions (104 for one condition and 105 for the other). (b) False discovery rates and power under simulation settings with different proportions of DA taxa. The sample size is 100. The total number of taxa is 500. The proportion of DA taxa is 5%, 10%, 20%, or 30%. The average library size is 2 × 104 for both conditions. The average absolute abundance fold change of DA taxa is 5. The directions of absolute abundance changes of DA taxa may be balanced or unbalanced.
Fig. 3
Fig. 3. Microbiome differential abundance analysis between children who developed early childhood dental caries and those who did not.
(a) 38 out of 155 total amplicon sequence variants in the saliva samples collected at 12 months old are differentially abundant based on at least one method. ADAPT detects 27 DA ASVs. (b) 14 out of 590 taxa in the plaque samples collected between 36 and 60 months old are differentially abundant based on at least one method. ADAPT detects 12 DA taxa. (c) Volcano plot for DAA of saliva samples (d) Volcano plot for DAA of plaque samples

Similar articles

References

    1. Human Microbiome Project Consortium. Structure, function and diversity of the healthy human microbiome. Nature 486, 207–214 (2012). - PMC - PubMed
    1. Yatsunenko T. et al. Human gut microbiome viewed across age and geography. Nature 486, 222–227 (2012). - PMC - PubMed
    1. McDonald D. et al. American Gut: an Open Platform for Citizen Science Microbiome Research. mSystems 3 (2018). - PMC - PubMed
    1. Li H. Microbiome, Metagenomics, and High-Dimensional Compositional Data Analysis. Annual Review of Statistics and Its Application 2, 73–94 (2015).
    1. Nearing J. T. et al. Microbiome differential abundance methods produce different results across 38 datasets. Nature Communications 13, 342 (2022). - PMC - PubMed

Publication types