Zero tolerance ecology: improving ecological inference by modelling the source of zero observations
Top Cited Papers
Open Access
- 6 October 2005
- journal article
- review article
- Published by Wiley in Ecology Letters
- Vol. 8 (11) , 1235-1246
- https://doi.org/10.1111/j.1461-0248.2005.00826.x
Abstract
A common feature of ecological data sets is their tendency to contain many zero values. Statistical inference based on such data are likely to be inefficient or wrong unless careful thought is given to how these zeros arose and how best to model them. In this paper, we propose a framework for understanding how zero‐inflated data sets originate and deciding how best to model them. We define and classify the different kinds of zeros that occur in ecological data and describe how they arise: either from ‘true zero’ or ‘false zero’ observations. After reviewing recent developments in modelling zero‐inflated data sets, we use practical examples to demonstrate how failing to account for the source of zero inflation can reduce our ability to detect relationships in ecological data and at worst lead to incorrect inference. The adoption of methods that explicitly model the sources of zero observations will sharpen insights and improve the robustness of ecological analyses.Keywords
This publication has 55 references indexed in Scilit:
- Modelling skewed data with many zeros: A simple approach combining ordinary and logistic regressionEnvironmental and Ecological Statistics, 2005
- A bivariate zero-inflated negative binomial regression model for count data with excess zerosEconomics Letters, 2003
- Estimating bird species richness: How should repeat surveys be organized in time?Austral Ecology, 2002
- Bayesian Measures of Model Complexity and FitJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Zero‐inflated models for regression analysis of count data: a study of growth and developmentStatistics in Medicine, 2002
- Inferring the Absence of a Species: A Case Study of SnakesThe Journal of Wildlife Management, 2002
- Current approaches to modelling the environmental niche of eucalypts: implication for management of forest biodiversityForest Ecology and Management, 1996
- Modelling the abundance of rare species: statistical models for counts with extra zerosEcological Modelling, 1996
- Zero-Inflated Poisson Regression, with an Application to Defects in ManufacturingTechnometrics, 1992
- Specification and testing of some modified count data modelsJournal of Econometrics, 1986