FORWARD SELECTION OF EXPLANATORY VARIABLES
Top Cited Papers
Open Access
- 1 September 2008
- Vol. 89 (9) , 2623-2632
- https://doi.org/10.1890/07-0986.1
Abstract
This paper proposes a new way of using forward selection of explanatory variables in regression or canonical redundancy analysis. The classical forward selection method presents two problems: a highly inflated Type I error and an overestimation of the amount of explained variance. Correcting these problems will greatly improve the performance of this very useful method in ecological modeling. To prevent the first problem, we propose a two‐step procedure. First, a global test using all explanatory variables is carried out. If, and only if, the global test is significant, one can proceed with forward selection. To prevent overestimation of the explained variance, the forward selection has to be carried out with two stopping criteria: (1) the usual alpha significance level and (2) the adjusted coefficient of multiple determination ( ) calculated using all explanatory variables. When forward selection identifies a variable that brings one or the other criterion over the fixed threshold, that variable is rejected, and the procedure is stopped. This improved method is validated by simulations involving univariate and multivariate response data. An ecological example is presented using data from the Bryce Canyon National Park, Utah, USA.Keywords
This publication has 27 references indexed in Scilit:
- Identifying spatial relationships at multiple scales: principal coordinates of neighbour matrices (PCNM) and geostatistical approachesEcography, 2007
- Little evidence for climate effects on local‐scale structure and dynamics of California kelp forest communitiesGlobal Change Biology, 2006
- Why do we still use stepwise modelling in ecology and behaviour?Journal of Animal Ecology, 2006
- Spatial modelling: a comprehensive framework for principal coordinate analysis of neighbour matrices (PCNM)Ecological Modelling, 2006
- Multiscale spatial distribution of a littoral fish community in relation to environmental variablesLimnology and Oceanography, 2005
- All-scale spatial analysis of ecological data by means of principal coordinates of neighbour matricesPublished by Elsevier ,2002
- Bootstrapping R2 and adjusted R2 in regression analysisEconomic Modelling, 2000
- An empirical comparison of permutation methods for tests of partial regression coefficients in a linear modelJournal of Statistical Computation and Simulation, 1999
- The Problem of Underestimating the Residual Error Variance in Forward Stepwise RegressionJournal of the Royal Statistical Society: Series D (The Statistician), 1992
- Tests of significance in stepwise regression.Psychological Bulletin, 1979