Using Spatial Statistics to Select Model Complexity

1 June 2002

journal article
Published by Taylor & Francis in Journal of Computational and Graphical Statistics

Vol. 11 (2) , 348-369
https://doi.org/10.1198/106186002760180554

Abstract

Testing for nonindependence among the residuals from a regression or time series model is a common approach to evaluating the adequacy of a fitted model. This idea underlies the familiar Durbin–Watson statistic, and previous works illustrate how the spatial autocorrelation among residuals can be used to test a candidate linear model. We propose here that a version of Moran's I statistic for spatial autocorrelation, applied to residuals from a fitted model, is a practical general tool for selecting model complexity under the assumption of iid additive errors. The “space” is defined by the independent variables, and the presence of significant spatial autocorrelation in residuals is evidence that a more complex model is needed to capture all of the structure in the data. An advantage of this approach is its generality, which results from the fact that no properties of the fitted model are used other than consistency. The problem of smoothing parameter selection in nonparametric regression is used to illustrate the performance of model selection based on residual spatial autocorrelation (RSA). In simulation trials comparing RSA with established selection criteria based on minimizing mean square prediction error, smooths selected by RSA exhibit fewer spurious features such as minima and maxima. In some cases, at higher noise levels, RSA smooths achieved a lower average mean square error than smooths selected by GCV. We also briefly describe a possible modification of the method for non-iid errors having short-range correlations, for example, time-series errors or spatial data. Some other potential applications are suggested, including variable selection in regression models.

Keywords

This publication has 9 references indexed in Scilit:

Nonparametric Regressin with Correlated Errors
Statistical Science, 2001
A Comparison of Regression Spline Smoothing Procedures
Computational Statistics, 2000
Theory & Methods: Spatially‐adaptive Penalties for Spline Fitting
Australian & New Zealand Journal of Statistics, 2000
When is a feature really there: the SiZer approach
Published by SPIE-Intl Soc Optical Eng ,1998
Smoothing Spline Models with Correlated Random Errors
Journal of the American Statistical Association, 1998
Flexible smoothing with B-splines and penalties
Statistical Science, 1996
An Effective Bandwidth Selector for Local Least Squares Regression
Journal of the American Statistical Association, 1995
Visual Error Criteria for Qualitative Smoothing
Journal of the American Statistical Association, 1995
Permutation Tests for Correlation in Regression Errors
Journal of the American Statistical Association, 1994