Multiple additive regression trees with application in epidemiology
Top Cited Papers
- 14 April 2003
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 22 (9) , 1365-1381
- https://doi.org/10.1002/sim.1501
Abstract
Predicting future outcomes based on knowledge obtained from past observational data is a common application in a wide variety of areas of scientific research. In the present paper, prediction will be focused on various grades of cervical preneoplasia and neoplasia. Statistical tools used for prediction should of course possess predictive accuracy, and preferably meet secondary requirements such as speed, ease of use, and interpretability of the resulting predictive model. A new automated procedure based on an extension (called ‘boosting’) of regression and classification tree (CART) models is described. The resulting tool is a fast ‘off‐the‐shelf’ procedure for classification and regression that is competitive in accuracy with more customized approaches, while being fairly automatic to use (little tuning), and highly robust especially when applied to less than clean data. Additional tools are presented for interpreting and visualizing the results of such multiple additive regression tree (MART) models. Copyright © 2003 John Wiley & Sons, Ltd.Keywords
This publication has 2 references indexed in Scilit:
- Greedy function approximation: A gradient boosting machine.The Annals of Statistics, 2001
- Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the authors)The Annals of Statistics, 2000