Variable Selection for Cox's proportional Hazards Model and Frailty Model
Top Cited Papers
Open Access
- 1 February 2002
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 30 (1) , 74-99
- https://doi.org/10.1214/aos/1015362185
Abstract
A class of variable selection procedures for parametric models via nonconcave penalized likelihood was proposed in Fan and Li (2001a). It has been shown there that the resulting procedures perform as well as if the subset of significant variables were known in advance. Such a property is called an oracle property. The proposed procedures were illustrated in the context of linear regression, robust linear regression and generalized linear models. In this paper, the nonconcave penalized likelihood approach is extended further to the Cox proportional hazards model and the Cox proportional hazards frailty model, two commonly used semi-parametric models in survival analysis. As a result, new variable selection procedures for these two commonly-used models are proposed. It is demonstrated how the rates of convergence depend on the regularization parameter in the penalty function. Further, with a proper choice of the regularization parameter and the penalty function, the proposed estimators possess an oracle property. Standard error formulae are derived and their accuracies are empirically tested. Simulation studies show that the proposed procedures are more stable in prediction and more effective in computation than the best subset variable selection, and they reduce model complexity as effectively as the best subset variable selection. Compared with the LASSO, which is the penalized likelihood method with the $L_1$ -penalty, proposed by Tibshirani, the newly proposed approaches have better theoretic properties and finite sample performance.
Keywords
This publication has 25 references indexed in Scilit:
- Asymptotics for lasso-type estimatorsThe Annals of Statistics, 2000
- On Profile LikelihoodJournal of the American Statistical Association, 2000
- Observed Information in Semi-Parametric ModelsBernoulli, 1999
- Asymptotic theory for the correlated gamma-frailty modelThe Annals of Statistics, 1998
- Heuristics of instability and stabilization in model selectionThe Annals of Statistics, 1996
- The Stochastic Difference Between Econometric StatisticsEconometrica, 1988
- Asymptotic Optimality for $C_p, C_L$, Cross-Validation and Generalized Cross-Validation: Discrete Index SetThe Annals of Statistics, 1987
- A Comparison of GCV and GML for Choosing the Smoothing Parameter in the Generalized Spline Smoothing ProblemThe Annals of Statistics, 1985
- Cox's Regression Model for Counting Processes: A Large Sample StudyThe Annals of Statistics, 1982
- One-Step Huber Estimates in the Linear ModelJournal of the American Statistical Association, 1975