An improved nonparametric lower bound of species richness via a modified good-turing frequency formula
- PMID: 24945937
- DOI: 10.1111/biom.12200
An improved nonparametric lower bound of species richness via a modified good-turing frequency formula
Abstract
It is difficult to accurately estimate species richness if there are many almost undetectable species in a hyper-diverse community. Practically, an accurate lower bound for species richness is preferable to an inaccurate point estimator. The traditional nonparametric lower bound developed by Chao (1984, Scandinavian Journal of Statistics 11, 265-270) for individual-based abundance data uses only the information on the rarest species (the numbers of singletons and doubletons) to estimate the number of undetected species in samples. Applying a modified Good-Turing frequency formula, we derive an approximate formula for the first-order bias of this traditional lower bound. The approximate bias is estimated by using additional information (namely, the numbers of tripletons and quadrupletons). This approximate bias can be corrected, and an improved lower bound is thus obtained. The proposed lower bound is nonparametric in the sense that it is universally valid for any species abundance distribution. A similar type of improved lower bound can be derived for incidence data. We test our proposed lower bounds on simulated data sets generated from various species abundance models. Simulation results show that the proposed lower bounds always reduce bias over the traditional lower bounds and improve accuracy (as measured by mean squared error) when the heterogeneity of species abundances is relatively high. We also apply the proposed new lower bounds to real data for illustration and for comparisons with previously developed estimators.
Keywords: Abundance data; Good–Turing frequency formula; Incidence data; Species richness.
© 2014, The International Biometric Society.
Similar articles
-
Nonparametric lower bounds for species richness and shared species richness under sampling without replacement.Biometrics. 2012 Sep;68(3):912-21. doi: 10.1111/j.1541-0420.2011.01739.x. Epub 2012 Feb 20. Biometrics. 2012. PMID: 22348318
-
A more reliable species richness estimator based on the Gamma-Poisson model.PeerJ. 2023 Jan 6;11:e14540. doi: 10.7717/peerj.14540. eCollection 2023. PeerJ. 2023. PMID: 36632143 Free PMC article.
-
Unveiling the species-rank abundance distribution by generalizing the Good-Turing sample coverage theory.Ecology. 2015 May;96(5):1189-201. doi: 10.1890/14-0550.1. Ecology. 2015. PMID: 26236834
-
Nonparametric multiple comparisons.Behav Res Methods. 2020 Apr;52(2):489-502. doi: 10.3758/s13428-019-01247-9. Behav Res Methods. 2020. PMID: 31062191 Review.
-
Berry-Esseen bounds of weighted kernel estimator for a nonparametric regression model based on linear process errors under a LNQD sequence.J Inequal Appl. 2018;2018(1):10. doi: 10.1186/s13660-017-1604-8. Epub 2018 Jan 8. J Inequal Appl. 2018. PMID: 29367822 Free PMC article. Review.
Cited by
-
Comprehensive characterization and database construction of immune repertoire in the largest Chinese glioma cohort.iScience. 2023 Dec 8;27(1):108661. doi: 10.1016/j.isci.2023.108661. eCollection 2024 Jan 19. iScience. 2023. PMID: 38205245 Free PMC article.
-
Environmental stress mediates groundwater microbial community assembly.Nat Microbiol. 2024 Feb;9(2):490-501. doi: 10.1038/s41564-023-01573-x. Epub 2024 Jan 11. Nat Microbiol. 2024. PMID: 38212658
-
Biodiversity survey and estimation for line-transect sampling.Front Plant Sci. 2023 Nov 10;14:1159090. doi: 10.3389/fpls.2023.1159090. eCollection 2023. Front Plant Sci. 2023. PMID: 38023934 Free PMC article.
-
Diversity patterns, ecological associations and future of research on Costa Rican myxomycetes.Mycology. 2018 Jun 5;9(4):250-263. doi: 10.1080/21501203.2018.1481153. eCollection 2018. Mycology. 2018. PMID: 30533251 Free PMC article.
-
The effect of 16S rRNA region choice on bacterial community metabarcoding results.Sci Data. 2019 Feb 5;6:190007. doi: 10.1038/sdata.2019.7. Sci Data. 2019. PMID: 30720800 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources