Variable Selection for Semiparametric Mixed Models in Longitudinal Studies
- 17 March 2010
- journal article
- Published by Oxford University Press (OUP) in Biometrics
- Vol. 66 (1) , 79-88
- https://doi.org/10.1111/j.1541-0420.2009.01240.x
Abstract
SummaryWe propose a double‐penalized likelihood approach for simultaneous model selection and estimation in semiparametric mixed models for longitudinal data. Two types of penalties are jointly imposed on the ordinary log‐likelihood: the roughness penalty on the nonparametric baseline function and a nonconcave shrinkage penalty on linear coefficients to achieve model sparsity. Compared to existing estimation equation based approaches, our procedure provides valid inference for data with missing at random, and will be more efficient if the specified model is correct. Another advantage of the new procedure is its easy computation for both regression components and variance parameters. We show that the double‐penalized problem can be conveniently reformulated into a linear mixed model framework, so that existing software can be directly used to implement our method. For the purpose of model inference, we derive both frequentist and Bayesian variance estimation for estimated parametric and nonparametric components. Simulation is used to evaluate and compare the performance of our method to the existing ones. We then apply the new method to a real data set from a lactation study.Keywords
This publication has 24 references indexed in Scilit:
- One-step sparse estimates in nonconcave penalized likelihood modelsThe Annals of Statistics, 2008
- Tuning parameter selectors for the smoothly clipped absolute deviation methodBiometrika, 2007
- Regularization and Variable Selection Via the Elastic NetJournal of the Royal Statistical Society Series B: Statistical Methodology, 2005
- Estimation in a semiparametric model for longitudinal data with unspecified dependence structureBiometrika, 2002
- Inference in Generalized Additive Mixed Models by Using Smoothing SplinesJournal of the Royal Statistical Society Series B: Statistical Methodology, 1999
- Semiparametric Stochastic Mixed Models for Longitudinal DataJournal of the American Statistical Association, 1998
- The Performance of Cross-Validation and Maximum Likelihood Estimators of Spline Smoothing ParametersJournal of the American Statistical Association, 1991
- A Comparison of GCV and GML for Choosing the Smoothing Parameter in the Generalized Spline Smoothing ProblemThe Annals of Statistics, 1985
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- Maximum Likelihood Approaches to Variance Component Estimation and to Related ProblemsJournal of the American Statistical Association, 1977