Component selection and smoothing in multivariate nonparametric regression
Top Cited Papers
Open Access
- 1 October 2006
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 34 (5) , 2272-2297
- https://doi.org/10.1214/009053606000000722
Abstract
We propose a new method for model selection and model fitting in multivariate nonparametric regression models, in the framework of smoothing spline ANOVA. The “COSSO” is a method of regularization with the penalty functional being the sum of component norms, instead of the squared norm employed in the traditional smoothing spline method. The COSSO provides a unified framework for several recent proposals for model selection in linear models and smoothing spline ANOVA models. Theoretical properties, such as the existence and the rate of convergence of the COSSO estimator, are studied. In the special case of a tensor product design with periodic functions, a detailed analysis reveals that the COSSO does model selection by applying a novel soft thresholding type operation to the function components. We give an equivalent formulation of the COSSO estimator which leads naturally to an iterative algorithm. We compare the COSSO with MARS, a popular method that builds functional ANOVA models, in simulations and real examples. The COSSO method can be extended to classification problems and we compare its performance with those of a number of machine learning algorithms on real datasets. The COSSO gives very competitive performance in these studies.Keywords
All Related Versions
This publication has 19 references indexed in Scilit:
- Inference After Model SelectionJournal of the American Statistical Association, 2004
- Variable Selection and Model Building via Likelihood Basis PursuitJournal of the American Statistical Association, 2004
- Least angle regressionThe Annals of Statistics, 2004
- Bayesian Variable Selection and Model Averaging in High-Dimensional Multinomial Nonparametric RegressionJournal of Computational and Graphical Statistics, 2003
- Adaptive Model SelectionJournal of the American Statistical Association, 2002
- On Measuring and Correcting the Effects of Data Mining and Model SelectionJournal of the American Statistical Association, 1998
- Smoothing spline ANOVA for exponential families, with application to the Wisconsin Epidemiological Study of Diabetic Retinopathy : the 1994 Neyman Memorial LectureThe Annals of Statistics, 1995
- Better Subset Regression Using the Nonnegative GarroteTechnometrics, 1995
- Diagnostics for Nonparametric Regression Models with Additive TermsJournal of the American Statistical Association, 1992
- Multivariate Adaptive Regression SplinesThe Annals of Statistics, 1991