Asymptotic properties of bridge estimators in sparse high-dimensional regression models
Top Cited Papers
Open Access
- 1 April 2008
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 36 (2)
- https://doi.org/10.1214/009053607000000875
Abstract
We study the asymptotic properties of bridge estimators in sparse, high-dimensional, linear regression models when the number of covariates may increase to infinity with the sample size. We are particularly interested in the use of bridge estimators to distinguish between covariates whose coefficients are zero and covariates whose coefficients are nonzero. We show that under appropriate conditions, bridge estimators correctly select covariates with nonzero coefficients with probability converging to one and that the estimators of nonzero coefficients have the same asymptotic distribution that they would have if the zero coefficients were known in advance. Thus, bridge estimators have an oracle property in the sense of Fan and Li [J. Amer. Statist. Assoc. 96 (2001) 1348--1360] and Fan and Peng [Ann. Statist. 32 (2004) 928--961]. In general, the oracle property holds only if the number of covariates is smaller than the sample size. However, under a partial orthogonality condition in which the covariates of the zero coefficients are uncorrelated or weakly correlated with the covariates of nonzero coefficients, we show that marginal bridge estimators can correctly distinguish between covariates with nonzero and zero coefficients with probability converging to one even when the number of covariates is greater than the sample size.Comment: Published in at http://dx.doi.org/10.1214/009053607000000875 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.orgKeywords
All Related Versions
This publication has 17 references indexed in Scilit:
- Marginal asymptotics for the “large $p$, small $n$” paradigm: With applications to microarray dataThe Annals of Statistics, 2007
- Boosting for high-dimensional linear modelsThe Annals of Statistics, 2006
- Prediction by Supervised Principal ComponentsJournal of the American Statistical Association, 2006
- A Two-Way Semilinear Model for Normalization and Analysis of cDNA Microarray DataJournal of the American Statistical Association, 2005
- Semilinear High-Dimensional Model for Normalization of Microarray DataJournal of the American Statistical Association, 2005
- Regularization and Variable Selection Via the Elastic NetJournal of the Royal Statistical Society Series B: Statistical Methodology, 2005
- Nonconcave penalized likelihood with a diverging number of parametersThe Annals of Statistics, 2004
- Gene expression analysis with the parametric bootstrapBiostatistics, 2001
- A Statistical View of Some Chemometrics Regression ToolsTechnometrics, 1993
- Ridge Regression: Biased Estimation for Nonorthogonal ProblemsTechnometrics, 1970