Maximum Likelihood Estimation of the Negative Binomial Dispersion Parameter for Highly Overdispersed Data, with Applications to Infectious Diseases
Open Access
- 14 February 2007
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 2 (2) , e180
- https://doi.org/10.1371/journal.pone.0000180
Abstract
The negative binomial distribution is used commonly throughout biology as a model for overdispersed count data, with attention focused on the negative binomial dispersion parameter, k. A substantial literature exists on the estimation of k, but most attention has focused on datasets that are not highly overdispersed (i.e., those with k≥1), and the accuracy of confidence intervals estimated for k is typically not explored. This article presents a simulation study exploring the bias, precision, and confidence interval coverage of maximum-likelihood estimates of k from highly overdispersed distributions. In addition to exploring small-sample bias on negative binomial estimates, the study addresses estimation from datasets influenced by two types of event under-counting, and from disease transmission data subject to selection bias for successful outbreaks. Results show that maximum likelihood estimates of k can be biased upward by small sample size or under-reporting of zero-class events, but are not biased downward by any of the factors considered. Confidence intervals estimated from the asymptotic sampling variance tend to exhibit coverage below the nominal level, with overestimates of k comprising the great majority of coverage errors. Estimation from outbreak datasets does not increase the bias of k estimates, but can add significant upward bias to estimates of the mean. Because k varies inversely with the degree of overdispersion, these findings show that overestimation of the degree of overdispersion is very rare for these datasets.Keywords
This publication has 25 references indexed in Scilit:
- Bias‐Corrected Maximum Likelihood Estimator of the Negative Binomial Dispersion ParameterBiometrics, 2005
- Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control MeasuresAmerican Journal of Epidemiology, 2004
- Spatial modelling of individual-level parasite counts using the negative binomial distributionBiostatistics, 2000
- Confidence curves and improved exact confidence intervals for discrete distributionsThe Canadian Journal of Statistics / La Revue Canadienne de Statistique, 2000
- Linear model analysis of net catch data using the negative binomial distributionCanadian Journal of Fisheries and Aquatic Sciences, 1999
- Analysis of Frequency Count Data Using the Negative Binomial DistributionEcology, 1996
- Estimation of the Negative Binomial Parameter κ by Maximum Quasi -LikelihoodBiometrics, 1989
- The Negative Binomial DistributionJournal of the Royal Statistical Society: Series D (The Statistician), 1985
- Multistage Estimation Compared with Fixed-Sample-Size Estimation of the Negative Binomial Parameter kPublished by JSTOR ,1984
- Small Sample Comparison of Different Estimators of Negative Binomial ParametersBiometrics, 1977