Marginal and Mixed-Effects Models in the Analysis of Human Papillomavirus Natural History Data
Open Access
- 1 January 2010
- journal article
- research article
- Published by American Association for Cancer Research (AACR) in Cancer Epidemiology, Biomarkers & Prevention
- Vol. 19 (1) , 159-169
- https://doi.org/10.1158/1055-9965.epi-09-0546
Abstract
Human papillomavirus (HPV) natural history has several characteristics that, at least from a statistical perspective, are not often encountered elsewhere in infectious disease and cancer research. There are, for example, multiple HPV types, and infection by each HPV type may be considered separate events. Although concurrent infections are common, the prevalence, incidence, and duration/persistence of each individual HPV can be separately measured. However, repeated measures involving the same subject tend to be correlated. The probability of detecting any given HPV type, for example, is greater among individuals who are currently positive for at least one other HPV type. Serial testing for HPV over time represents a second form of repeated measures. Statistical inferences that fail to take these correlations into account would be invalid. However, methods that do not use all the data would be inefficient. Marginal and mixed-effects models can address these issues but are not frequently used in HPV research. The current study provides an overview of these methods and then uses HPV data from a cohort of HIV-positive women to illustrate how they may be applied, and compare their results. The findings show the greater efficiency of these models compared with standard logistic regression and Cox models. Because mixed-effects models estimate subject-specific associations, they sometimes gave much higher effect estimates than marginal models, which estimate population-averaged associations. Overall, the results show that marginal and mixed-effects models are efficient for studying HPV natural history, but also highlight the importance of understanding how these models differ. Cancer Epidemiol Biomakers Prev; 19(1); 159–69Keywords
This publication has 29 references indexed in Scilit:
- Rapid Clearance of Human Papillomavirus and Implications for Clinical Focus on Persistent InfectionsJNCI Journal of the National Cancer Institute, 2008
- Natural History and Possible Reactivation of Human Papillomavirus in Human Immunodeficiency Virus–Positive WomenJNCI Journal of the National Cancer Institute, 2005
- Analysis of data with multiple sources of correlation in the framework of generalized estimating equationsStatistics in Medicine, 2004
- Design and sample size estimation in clinical trials with clustered survival times as the primary endpointStatistics in Medicine, 2003
- Do We Still Need a Cohort Study of Women with HIV Infection?Epidemiology, 1998
- Design and analysis considerations in a cohort study involving repeated measurement of both exposure and outcome: The association between genital papillomavirus infection and risk of cervical intraepithelial neoplasiaStatistics in Medicine, 1994
- General class of covariance structures for two or more repeated factors in longitudinal data analysisCommunications in Statistics - Theory and Methods, 1994
- Regression Analysis of Multivariate Incomplete Failure Time Data by Modeling Marginal DistributionsJournal of the American Statistical Association, 1989
- Regression Analysis of Multivariate Incomplete Failure Time Data by Modeling Marginal DistributionsJournal of the American Statistical Association, 1989
- Longitudinal data analysis using generalized linear modelsBiometrika, 1986