Analyzing Incomplete Discrete Longitudinal Clinical Trial Data
Open Access
- 1 February 2006
- journal article
- Published by Institute of Mathematical Statistics in Statistical Science
- Vol. 21 (1)
- https://doi.org/10.1214/088342305000000322
Abstract
Commonly used methods to analyze incomplete longitudinal clinical trial data include complete case analysis (CC) and last observation carried forward (LOCF). However, such methods rest on strong assumptions, including missing completely at random (MCAR) for CC and unchanging profile after dropout for LOCF. Such assumptions are too strong to generally hold. Over the last decades, a number of full longitudinal data analysis methods have become available, such as the linear mixed model for Gaussian outcomes, that are valid under the much weaker missing at random (MAR) assumption. Such a method is useful, even if the scientific question is in terms of a single time point, for example, the last planned measurement occasion, and it is generally consistent with the intention-to-treat principle. The validity of such a method rests on the use of maximum likelihood, under which the missing data mechanism is ignorable as soon as it is MAR. In this paper, we will focus on non-Gaussian outcomes, such as binary, categorical or count data. This setting is less straightforward since there is no unambiguous counterpart to the linear mixed model. We first provide an overview of the various modeling frameworks for non-Gaussian longitudinal data, and subsequently focus on generalized linear mixed-effects models, on the one hand, of which the parameters can be estimated using full likelihood, and on generalized estimating equations, on the other hand, which is a nonlikelihood method and hence requires a modification to be valid under MAR. We briefly comment on the position of models that assume missingness not at random and argue they are most useful to perform sensitivity analysis. Our developments are underscored using data from two studies. While the case studies feature binary outcomes, the methodology applies equally well to other discrete-data settings, hence the qualifier ``discrete'' in the title.Comment: Published at http://dx.doi.org/10.1214/088342305000000322 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.orgKeywords
All Related Versions
This publication has 49 references indexed in Scilit:
- Analyzing incomplete longitudinal clinical trial dataBiostatistics, 2004
- Frequentist Model Average EstimatorsJournal of the American Statistical Association, 2003
- Type I Error Rates from Mixed Effects Model Repeated Measures Versus Fixed Effects Anova with Missing Values Imputed Via Last Observation Carried ForwardDrug Information Journal, 2001
- Likelihood based frequentist inference when data are missing at randomStatistical Science, 1998
- A comparison of the random-effects pattern mixture model with last-observation-carried-forward(locf) analysis in longitudinal clinical trials with dropoutsJournal of Biopharmaceutical Statistics, 1998
- A Saturated Model for Analyzing Exchangeable Binary Data: Applications to Clinical and Developmental Toxicity StudiesJournal of the American Statistical Association, 1995
- Handling “Don't Know” Survey Responses: The Case of the Slovenian PlebisciteJournal of the American Statistical Association, 1995
- Analysis of Semiparametric Regression Models for Repeated Outcomes in the Presence of Missing DataJournal of the American Statistical Association, 1995
- Marginal Modeling of Correlated Ordinal Data Using a Multivariate Plackett DistributionJournal of the American Statistical Association, 1994
- Simultaneously Modeling Joint and Marginal Distributions of Multivariate Categorical ResponsesJournal of the American Statistical Association, 1994