Stability Selection
Top Cited Papers
Open Access
- 5 August 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Journal of the Royal Statistical Society Series B: Statistical Methodology
- Vol. 72 (4) , 417-473
- https://doi.org/10.1111/j.1467-9868.2010.00740.x
Abstract
Summary: Estimation of structure, such as in variable selection, graphical modelling or cluster analysis, is notoriously difficult, especially for high dimensional data. We introduce stability selection. It is based on subsampling in combination with (high dimensional) selection algorithms. As such, the method is extremely general and has a very wide range of applicability. Stability selection provides finite sample control for some error rates of false discoveries and hence a transparent principle to choose a proper amount of regularization for structure estimation. Variable selection and structure estimation improve markedly for a range of selection methods if stability selection is applied. We prove for the randomized lasso that stability selection will be variable selection consistent even if the necessary conditions for consistency of the original lasso method are violated. We demonstrate stability selection for variable selection and Gaussian graphical modelling, using real and simulated data.Keywords
This publication has 66 references indexed in Scilit:
- Mapping in Structured Populations by Resample Model AveragingGenetics, 2009
- Sure Independence Screening for Ultrahigh Dimensional Feature SpaceJournal of the Royal Statistical Society Series B: Statistical Methodology, 2008
- Conditional variable importance for random forestsBMC Bioinformatics, 2008
- Comparing the Characteristics of Gene Expression Profiles Derived by Univariate and Multivariate Classification MethodsStatistical Applications in Genetics and Molecular Biology, 2008
- Sparse inverse covariance estimation with the graphical lassoBiostatistics, 2007
- On the “degrees of freedom” of the lassoThe Annals of Statistics, 2007
- Sparsity oracle inequalities for the LassoElectronic Journal of Statistics, 2007
- Multiple Hypothesis Testing in Microarray ExperimentsStatistical Science, 2003
- Variable selection in qualitative models via an entropic explanatory powerJournal of Statistical Planning and Inference, 2003
- A Remark on the Difference between Sampling with and without ReplacementJournal of the American Statistical Association, 1977