Sharp Thresholds for High-Dimensional and Noisy Sparsity Recovery Using $\ell _{1}$-Constrained Quadratic Programming (Lasso)

Top Cited Papers

21 April 2009

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Information Theory

Vol. 55 (5) , 2183-2202
https://doi.org/10.1109/tit.2009.2016018

Abstract

The problem of consistently estimating the sparsity pattern of a vector beta* isin R^p based on observations contaminated by noise arises in various contexts, including signal denoising, sparse approximation, compressed sensing, and model selection. We analyze the behavior of l₁-constrained quadratic programming (QP), also referred to as the Lasso, for recovering the sparsity pattern. Our main result is to establish precise conditions on the problem dimension p, the number k of nonzero elements in beta*, and the number of observations n that are necessary and sufficient for sparsity pattern recovery using the Lasso. We first analyze the case of observations made using deterministic design matrices and sub-Gaussian additive noise, and provide sufficient conditions for support recovery and l_infin-error bounds, as well as results showing the necessity of incoherence and bounds on the minimum value. We then turn to the case of random designs, in which each row of the design is drawn from a N (0, Sigma) ensemble. For a broad class of Gaussian ensembles satisfying mutual incoherence conditions, we compute explicit values of thresholds 0 < thetas_l(Sigma) les thetas_u(Sigma) < +infin with the following properties: for any delta > 0, if n > 2 (thetas_u + delta) klog (p- k), then the Lasso succeeds in recovering the sparsity pattern with probability converging to one for large problems, whereas for n < 2 (thetas_l - delta)klog (p - k), then the probability of successful recovery converges to zero. For the special case of the uniform Gaussian ensemble (Sigma = I_ptimesp), we show that thetas_l = thetas<_u = 1, so that the precise threshold n = 2 klog(p- k) is exactly determined.

Keywords

This publication has 28 references indexed in Scilit:

Counting faces of randomly projected polytopes when the projection radically lowers dimension
Journal of the American Mathematical Society, 2008
The Dantzig selector: Statistical estimation when p is much larger than n
The Annals of Statistics, 2007
For most large underdetermined systems of linear equations the minimal 𝓁₁‐norm solution is also the sparsest solution
Communications on Pure and Applied Mathematics, 2006
For most large underdetermined systems of equations, the minimal 𝓁₁‐norm near‐solution approximates the sparsest near‐solution
Communications on Pure and Applied Mathematics, 2006
Decoding by Linear Programming
IEEE Transactions on Information Theory, 2005
Recovery of Exact Sparse Representations in the Presence of Bounded Noise
IEEE Transactions on Information Theory, 2005
Greed is Good: Algorithmic Results for Sparse Approximation
IEEE Transactions on Information Theory, 2004
Least angle regression
The Annals of Statistics, 2004
Chi-square oracle inequalities
Published by Institute of Mathematical Statistics ,2001
Asymptotics for lasso-type estimators
The Annals of Statistics, 2000