Aggregation for Gaussian regression

Open Access

1 August 2007

journal article
Published by Institute of Mathematical Statistics in The Annals of Statistics

Vol. 35 (4) , 1674-1697
https://doi.org/10.1214/009053606000001587

Abstract

This paper studies statistical aggregation procedures in the regression setting. A motivating factor is the existence of many different methods of estimation, leading to possibly competing estimators. We consider here three different types of aggregation: model selection (MS) aggregation, convex (C) aggregation and linear (L) aggregation. The objective of (MS) is to select the optimal single estimator from the list; that of (C) is to select the optimal convex combination of the given estimators; and that of (L) is to select the optimal linear combination of the given estimators. We are interested in evaluating the rates of convergence of the excess risks of the estimators obtained by these procedures. Our approach is motivated by recently published minimax results [Nemirovski, A. (2000). Topics in non-parametric statistics. Lectures on Probability Theory and Statistics (Saint-Flour, 1998). Lecture Notes in Math. 1738 85–277. Springer, Berlin; Tsybakov, A. B. (2003). Optimal rates of aggregation. Learning Theory and Kernel Machines. Lecture Notes in Artificial Intelligence 2777 303–313. Springer, Heidelberg]. There exist competing aggregation procedures achieving optimal convergence rates for each of the (MS), (C) and (L) cases separately. Since these procedures are not directly comparable with each other, we suggest an alternative solution. We prove that all three optimal rates, as well as those for the newly introduced (S) aggregation (subset selection), are nearly achieved via a single “universal” aggregation procedure. The procedure consists of mixing the initial estimators with weights obtained by penalized least squares. Two different penalties are considered: one of them is of the BIC type, the second one is a data-dependent ℓ₁-type penalty.

Keywords

All Related Versions

Version 1, 2007-10-19, ArXiv

This publication has 41 references indexed in Scilit:

Aggregated estimators and empirical complexity for least square regression
Annales de l'Institut Henri Poincaré, Probabilités et Statistiques, 2004
Aggregating regression procedures to improve performance
Bernoulli, 2004
Gaussian model selection
Journal of the European Mathematical Society, 2001
Adaptive Regression by Mixing
Journal of the American Statistical Association, 2001
On model selection
Published by Institute of Mathematical Statistics ,2001
On the LASSO and its Dual
Journal of Computational and Graphical Statistics, 2000
Functional aggregation for nonparametric regression
The Annals of Statistics, 2000
Estimating the Dimension of a Model
The Annals of Statistics, 1978
A new look at the statistical model identification
IEEE Transactions on Automatic Control, 1974
Some Comments onC_p
Technometrics, 1973