Analyzing Incomplete Discrete Longitudinal Clinical Trial Data

Open Access

1 February 2006

journal article
Published by Institute of Mathematical Statistics in Statistical Science

Vol. 21 (1)
https://doi.org/10.1214/088342305000000322

Abstract

Commonly used methods to analyze incomplete longitudinal clinical trial data include complete case analysis (CC) and last observation carried forward (LOCF). However, such methods rest on strong assumptions, including missing completely at random (MCAR) for CC and unchanging profile after dropout for LOCF. Such assumptions are too strong to generally hold. Over the last decades, a number of full longitudinal data analysis methods have become available, such as the linear mixed model for Gaussian outcomes, that are valid under the much weaker missing at random (MAR) assumption. Such a method is useful, even if the scientific question is in terms of a single time point, for example, the last planned measurement occasion, and it is generally consistent with the intention-to-treat principle. The validity of such a method rests on the use of maximum likelihood, under which the missing data mechanism is ignorable as soon as it is MAR. In this paper, we will focus on non-Gaussian outcomes, such as binary, categorical or count data. This setting is less straightforward since there is no unambiguous counterpart to the linear mixed model. We first provide an overview of the various modeling frameworks for non-Gaussian longitudinal data, and subsequently focus on generalized linear mixed-effects models, on the one hand, of which the parameters can be estimated using full likelihood, and on generalized estimating equations, on the other hand, which is a nonlikelihood method and hence requires a modification to be valid under MAR. We briefly comment on the position of models that assume missingness not at random and argue they are most useful to perform sensitivity analysis. Our developments are underscored using data from two studies. While the case studies feature binary outcomes, the methodology applies equally well to other discrete-data settings, hence the qualifier ``discrete'' in the title.Comment: Published at http://dx.doi.org/10.1214/088342305000000322 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

Keywords

All Related Versions

Version 1, 2006-06-20, ArXiv

This publication has 49 references indexed in Scilit:

Analyzing incomplete longitudinal clinical trial data
Biostatistics, 2004
Frequentist Model Average Estimators
Journal of the American Statistical Association, 2003
Type I Error Rates from Mixed Effects Model Repeated Measures Versus Fixed Effects Anova with Missing Values Imputed Via Last Observation Carried Forward
Drug Information Journal, 2001
Likelihood based frequentist inference when data are missing at random
Statistical Science, 1998
A comparison of the random-effects pattern mixture model with last-observation-carried-forward(locf) analysis in longitudinal clinical trials with dropouts
Journal of Biopharmaceutical Statistics, 1998
A Saturated Model for Analyzing Exchangeable Binary Data: Applications to Clinical and Developmental Toxicity Studies
Journal of the American Statistical Association, 1995
Handling “Don't Know” Survey Responses: The Case of the Slovenian Plebiscite
Journal of the American Statistical Association, 1995
Analysis of Semiparametric Regression Models for Repeated Outcomes in the Presence of Missing Data
Journal of the American Statistical Association, 1995
Marginal Modeling of Correlated Ordinal Data Using a Multivariate Plackett Distribution
Journal of the American Statistical Association, 1994
Simultaneously Modeling Joint and Marginal Distributions of Multivariate Categorical Responses
Journal of the American Statistical Association, 1994