Stability of multivariable fractional polynomial models with selection of variables and transformations: a bootstrap investigation
- 28 January 2003
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 22 (4) , 639-659
- https://doi.org/10.1002/sim.1310
Abstract
Sauerbrei and Royston have recently described an algorithm, based on fractional polynomials, for the simultaneous selection of variables and of suitable transformations for continuous predictors in a multivariable regression setting. They illustrated the approach by analyses of two breast cancer data sets. Here we extend their work by considering how to assess possible instability in such multivariable fractional polynomial models. We first apply the algorithm repeatedly in many bootstrap replicates. We then use log‐linear models to investigate dependencies among the inclusion fractions for each predictor and among the simplified classes of fractional polynomial function chosen in the bootstrap samples. To further evaluate the results, we define measures of instability based on a decomposition of the variability of the bootstrap‐selected functions in relation to a reference function from the original model. For each data set we are able to identify large, reasonably stable subsets of the bootstrap replications in which the functional forms of the predictors appear fairly stable. Despite the considerable flexibility of the family of fractional polynomials and the consequent risk of overfitting when several variables are considered, we conclude that the multivariable selection algorithm can find stable models. Copyright © 2003 John Wiley & Sons, Ltd.Keywords
This publication has 23 references indexed in Scilit:
- Corrigendum: Building Multivariable Prognostic and Diagnostic Models: Transformation of the Predictors by Using Fractional PolynomialsJournal of the Royal Statistical Society Series A: Statistics in Society, 2002
- Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authorsStatistical Science, 1999
- The Use of Resampling Methods to Simplify Regression Models in Medical StatisticsJournal of the Royal Statistical Society Series C: Applied Statistics, 1999
- Building Multivariable Prognostic and Diagnostic Models: Transformation of the Predictors by Using Fractional PolynomialsJournal of the Royal Statistical Society Series A: Statistics in Society, 1999
- Model Selection: An Integral Part of InferencePublished by JSTOR ,1997
- Dangers of Using "Optimal" Cutpoints in the Evaluation of Prognostic FactorsJNCI Journal of the National Cancer Institute, 1994
- Regression Using Fractional Polynomials of Continuous Covariates: Parsimonious Parametric ModellingJournal of the Royal Statistical Society Series C: Applied Statistics, 1994
- Displaying the Important Features of Large Collections of Similar CurvesThe American Statistician, 1992
- A bootstrap resampling procedure for model building: Application to the cox regression modelStatistics in Medicine, 1992
- The bootstrap and identification of prognostic factors via cox's proportional hazards regression modelStatistics in Medicine, 1985