Estimating population diversity with CatchAll
Open Access
- 13 February 2012
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 28 (7) , 1045-1047
- https://doi.org/10.1093/bioinformatics/bts075
Abstract
Motivation: The massive data produced by next-generation sequencing require advanced statistical tools. We address estimating the total diversity or species richness in a population. To date, only relatively simple methods have been implemented in available software. There is a need for software employing modern, computationally intensive statistical analyses including error, goodness-of-fit and robustness assessments. Results: We present CatchAll, a fast, easy-to-use, platform-independent program that computes maximum likelihood estimates for finite-mixture models, weighted linear regression-based analyses and coverage-based non-parametric methods, along with outlier diagnostics. Given sample ‘frequency count’ data, CatchAll computes 12 different diversity estimates and applies a model-selection algorithm. CatchAll also derives discounted diversity estimates to adjust for possibly uncertain low-frequency counts. It is accompanied by an Excel-based graphics program. Availability: Free executable downloads for Linux, Windows and Mac OS, with manual and source code, at www.northeastern.edu/catchall. Contact:jab18@cornell.eduThis publication has 9 references indexed in Scilit:
- Antibiotics in Feed Induce Prophages in Swine Fecal MicrobiomesmBio, 2011
- ESTIMATING POPULATION DIVERSITY WITH UNRELIABLE LOW FREQUENCY COUNTSBiocomputing 2020, 2011
- Population size estimation based upon ratios of recapture probabilitiesThe Annals of Applied Statistics, 2011
- Community cyberinfrastructure for Advanced Microbial Ecology Research and Analysis: the CAMERA resourceNucleic Acids Research, 2010
- ESTIMATING THE NUMBER OF SPECIES WITH CATCHALLBiocomputing 2020, 2010
- Depicting more accurate pictures of protistan community complexity using pyrosequencing of hypervariable SSU rRNA gene regionsEnvironmental Microbiology, 2010
- Parametric Models for Estimating the Number of ClassesBiometrical Journal, 2008
- Estimating the Number of Classes via Sample CoverageJournal of the American Statistical Association, 1992
- Estimating the Number of Classes via Sample CoverageJournal of the American Statistical Association, 1992