Latent Class Models for Joint Analysis of Longitudinal Biomarker and Event Process Data
- 1 March 2002
- journal article
- Published by Taylor & Francis in Journal of the American Statistical Association
- Vol. 97 (457) , 53-65
- https://doi.org/10.1198/016214502753479220
Abstract
A retrospective substudy of the nutritional prevention of cancer (NPC) trials investigated the utility of longitudinally measured prostate-specific antigen (PSA) as a biomarker for subsequent onset of prostate cancer (PCa). Serial PSA levels were determined retrospectively from frozen blood samples that had been collected from all patients at successive clinic visits with the timing and the number of these visits highly variable. Diagnosis dates of all incident cases of PCa were recorded. Heterogeneity in PSA trajectories was observed that could not be fully explained by the usual linear mixed-effects model and measured covariates. Latent class models that incorporate both a longitudinal biomarker process and an event process offer a way to handle additional heterogeneity, to uncover distinct subpopulations, to incorporate correlated nonnormally distributed outcomes, and to classify individuals into risk classes. Our latent class joint model can aid the prediction of PCa probability given the longitudinal biomarker information available on an individual up to any date. The proposed model easily accommodates highly unbalanced longitudinal data and recurrent events. There are two levels of structure in the latent class joint model. First, the uncertainty of latent class membership is specified through a multinomial logistic model. Second, the class-specific marker trajectory and event process are specified parametrically and semiparametrically, under the assumption of conditional independence given the latent class membership. We use a likelihood approach to obtain parameter estimates via the EM algorithm. We fit the latent class joint model to the data from the NPC trials; four distinct subpopulations are identified that differ with regard to their PSA trajectories and risk for prostate cancer. Higher PSA level is significantly associated with increased risk of PCa, but appears to be conditionally independent once the latent classes are taken into account. Among the covariates, selenium supplementation and age at entry are statistically significant for various parts of the model. Assumptions—in particular the conditional independence between the longitudinal PSA biomarker and time to PCa diagnosis—are assessed.Keywords
This publication has 24 references indexed in Scilit:
- Joint modelling of longitudinal measurements and event time dataBiostatistics, 2000
- A latent class mixed model for analysing biomarker trajectories with irregularly scheduled observationsStatistics in Medicine, 2000
- Cancer statistics, 2000CA: A Cancer Journal for Clinicians, 2000
- Latent Variable Regression for Multiple Discrete OutcomesJournal of the American Statistical Association, 1997
- Prostate cancer detection in men with serum PSA concentrations of 2.6 to 4.0 ng/mL and benign prostate examination. Enhancement of specificity with free PSA measurementsJAMA, 1997
- Effects of selenium supplementation for cancer prevention in patients with carcinoma of the skin. A randomized controlled trial. Nutritional Prevention of Cancer Study GroupJAMA, 1996
- SIMULTANEOUSLY MODELLING CENSORED SURVIVAL DATA AND REPEATEDLY MEASURED COVARIATES: A GIBBS SAMPLING APPROACHStatistics in Medicine, 1996
- Penalized minimum‐distance estimates in finite mixture modelsThe Canadian Journal of Statistics / La Revue Canadienne de Statistique, 1996
- Model misspecification in proportional hazards regressionBiometrika, 1995
- Consistent Estimation of a Mixing DistributionThe Annals of Statistics, 1992