On the Use of Zero-Inflated and Hurdle Models for Modeling Vaccine Adverse Event Count Data
- 1 August 2006
- journal article
- research article
- Published by Taylor & Francis in Journal of Biopharmaceutical Statistics
- Vol. 16 (4) , 463-481
- https://doi.org/10.1080/10543400600719384
Abstract
We compared several modeling strategies for vaccine adverse event count data in which the data are characterized by excess zeroes and heteroskedasticity. Count data are routinely modeled using Poisson and Negative Binomial (NB) regression but zero-inflated and hurdle models may be advantageous in this setting. Here we compared the fit of the Poisson, Negative Binomial (NB), zero-inflated Poisson (ZIP), zero-inflated Negative Binomial (ZINB), Poisson Hurdle (PH), and Negative Binomial Hurdle (NBH) models. In general, for public health studies, we may conceptualize zero-inflated models as allowing zeroes to arise from at-risk and not-at-risk populations. In contrast, hurdle models may be conceptualized as having zeroes only from an at-risk population. Our results illustrate, for our data, that the ZINB and NBH models are preferred but these models are indistinguishable with respect to fit. Choosing between the zero-inflated and hurdle modeling framework, assuming Poisson and NB models are inadequate because of excess zeroes, should generally be based on the study design and purpose. If the study's purpose is inference then modeling framework should be considered. For example, if the study design leads to count endpoints with both structural and sample zeroes then generally the zero-inflated modeling framework is more appropriate, while in contrast, if the endpoint of interest, by design, only exhibits sample zeroes (e.g., at-risk participants) then the hurdle model framework is generally preferred. Conversely, if the study's primary purpose it is to develop a prediction model then both the zero-inflated and hurdle modeling frameworks should be adequate.Keywords
This publication has 22 references indexed in Scilit:
- Accessing a New Medication in Germany: A Novel Approach to Assess a Health Insurance-Related BarrierAnnals of Epidemiology, 2005
- The decision-making process of health care utilization in MexicoHealth Policy, 2005
- The effect of a major cigarette price change on smoking behavior in california: a zero‐inflated negative binomial modelHealth Economics, 2004
- Modeling Count Data with Excess ZeroesSociological Methods & Research, 2003
- Zero‐inflated models for regression analysis of count data: a study of growth and developmentStatistics in Medicine, 2002
- Theory & Methods: Modelling Correlated Zero‐inflated Count DataAustralian & New Zealand Journal of Statistics, 2001
- Zero‐inflated Poisson regression with random effects to evaluate an occupational injury prevention programmeStatistics in Medicine, 2001
- Likelihood Ratio Tests for Model Selection and Non-Nested HypothesesEconometrica, 1989
- Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard ConditionsJournal of the American Statistical Association, 1987
- Specification and testing of some modified count data modelsJournal of Econometrics, 1986