A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics
Top Cited Papers
- 14 January 2005
- journal article
- Published by Walter de Gruyter GmbH in Statistical Applications in Genetics and Molecular Biology
- Vol. 4 (1) , Article32
- https://doi.org/10.2202/1544-6115.1175
Abstract
Inferring large-scale covariance matrices from sparse genomic data is an ubiquitous problem in bioinformatics. Clearly, the widely used standard covariance and correlation estimators are ill-suited for this purpose. As statistically efficient and computationally fast alternative we propose a novel shrinkage covariance estimator that exploits the Ledoit-Wolf (2003) lemma for analytic calculation of the optimal shrinkage intensity.Subsequently, we apply this improved covariance estimator (which has guaranteed minimum mean squared error, is well-conditioned, and is always positive definite even for small sample sizes) to the problem of inferring large-scale gene association networks. We show that it performs very favorably compared to competing approaches both in simulations as well as in application to real expression data.Keywords
This publication has 15 references indexed in Scilit:
- Estimating genomic coexpression networks using first-order conditional independenceGenome Biology, 2004
- Large-Scale Simultaneous Hypothesis TestingJournal of the American Statistical Association, 2004
- Diagnosis of multiple cancer types by shrunken centroids of gene expressionProceedings of the National Academy of Sciences, 2002
- Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networksProceedings of the National Academy of Sciences, 2000
- Estimation of the Scale Matrix and its Eigenvalues in the Wishart and the Multivariate F DistributionsAnnals of the Institute of Statistical Mathematics, 1998
- Parametric Empirical Bayes Inference: Theory and ApplicationsJournal of the American Statistical Association, 1983
- Stein's Paradox in StatisticsScientific American, 1977
- Data Analysis Using Stein's Estimator and its GeneralizationsJournal of the American Statistical Association, 1975
- Ridge Regression: Applications to Nonorthogonal ProblemsTechnometrics, 1970
- Ridge Regression: Biased Estimation for Nonorthogonal ProblemsTechnometrics, 1970