Improvements on Cross-Validation: The 632+ Bootstrap Method
- 1 June 1997
- journal article
- research article
- Published by Taylor & Francis in Journal of the American Statistical Association
- Vol. 92 (438) , 548-560
- https://doi.org/10.1080/01621459.1997.10474007
Abstract
A training set of data has been used to construct a rule for predicting future responses. What is the error rate of this rule? This is an important question both for comparing models and for assessing a final selected model. The traditional answer to this question is given by cross-validation. The cross-validation estimate of prediction error is nearly unbiased but can be highly variable. Here we discuss bootstrap estimates of prediction error, which can be thought of as smoothed versions of cross-validation. We show that a particular bootstrap method, the .632+ rule, substantially outperforms cross-validation in a catalog of 24 simulation experiments. Besides providing point estimates, we also consider estimating the variability of an error rate estimate. All of the results here are nonparametric and apply to any possible prediction rule; however, we study only classification problems with 0–1 loss in detail. Our simulations include “smooth” prediction rules like Fisher's linear discriminant function and unsmooth ones like nearest neighbors.Keywords
This publication has 13 references indexed in Scilit:
- An Introduction to the BootstrapPublished by Springer Nature ,1993
- Submodel Selection and Evaluation in Regression. The X-Random CaseInternational Statistical Review, 1992
- Efficient Bootstrap SimulationBiometrika, 1986
- How Biased is the Apparent Error Rate of a Prediction Rule?Journal of the American Statistical Association, 1986
- Correction note to ‘application of bootstrap and other resampling techniques: Evaluation of classifier performance’Pattern Recognition Letters, 1986
- Application of bootstrap and other resampling techniques: Evaluation of classifier performancePattern Recognition Letters, 1985
- Estimating the Error Rate of a Prediction Rule: Improvement on Cross-ValidationJournal of the American Statistical Association, 1983
- Bootstrap Methods: Another Look at the JackknifeThe Annals of Statistics, 1979
- The Predictive Sample Reuse Method with ApplicationsJournal of the American Statistical Association, 1975
- The Relationship Between Variable Selection and Data Agumentation and a Method for PredictionTechnometrics, 1974