High-dimensional Ising model selection using ℓ1-regularized logistic regression

Top Cited Papers

Open Access

1 June 2010

journal article
Published by Institute of Mathematical Statistics in The Annals of Statistics

Vol. 38 (3) , 1287-1319
https://doi.org/10.1214/09-aos691

Abstract

We consider the problem of estimating the graph associated with a binary Ising Markov random field. We describe a method based on ℓ₁-regularized logistic regression, in which the neighborhood of any given node is estimated by performing logistic regression subject to an ℓ₁-constraint. The method is analyzed under high-dimensional scaling in which both the number of nodes p and maximum neighborhood size d are allowed to grow as a function of the number of observations n. Our main results provide sufficient conditions on the triple (n, p, d) and the model parameters for the method to succeed in consistently estimating the neighborhood of every node in the graph simultaneously. With coherence conditions imposed on the population Fisher information matrix, we prove that consistent neighborhood selection can be obtained for sample sizes n=Ω(d³log p) with exponentially decaying error. When these same conditions are imposed directly on the sample matrices, we show that a reduced sample size of n=Ω(d²log p) suffices for the method to estimate neighborhoods consistently. Although this paper focuses on the binary graphical models, we indicate how a generalization of the method of the paper would apply to general discrete Markov random fields.

Keywords

All Related Versions

This publication has 19 references indexed in Scilit:

Support union recovery in high-dimensional multivariate regression
The Annals of Statistics, 2011
Sharp Thresholds for High-Dimensional and Noisy Sparsity Recovery Using $\ell _{1}$-Constrained Quadratic Programming (Lasso)
IEEE Transactions on Information Theory, 2009
Sparse permutation invariant covariance estimation
Electronic Journal of Statistics, 2008
The Dantzig selector: Statistical estimation when p is much larger than n
The Annals of Statistics, 2007
Consistent estimation of the basic neighborhood of Markov random fields
The Annals of Statistics, 2006
Model Selection and Estimation in Regression with Grouped Variables
Journal of the Royal Statistical Society Series B: Statistical Methodology, 2005
Maximum likelihood bounded tree-width Markov networks
Artificial Intelligence, 2003
Markov image modeling
IEEE Transactions on Automatic Control, 1978
Approximating discrete probability distributions with dependence trees
IEEE Transactions on Information Theory, 1968
Probability Inequalities for Sums of Bounded Random Variables
Journal of the American Statistical Association, 1963