BOOSTED TREES FOR ECOLOGICAL MODELING AND PREDICTION
Top Cited Papers
- 1 January 2007
- Vol. 88 (1) , 243-251
- https://doi.org/10.1890/0012-9658(2007)88[243:btfema]2.0.co;2
Abstract
Accurate prediction and explanation are fundamental objectives of statistical analysis, yet they seldom coincide. Boosted trees are a statistical learning method that attains both of these objectives for regression and classification analyses. They can deal with many types of response variables (numeric, categorical, and censored), loss functions (Gaussian, binomial, Poisson, and robust), and predictors (numeric, categorical). Interactions between predictors can also be quantified and visualized. The theory underpinning boosted trees is presented, together with interpretive techniques. A new form of boosted trees, namely, “aggregated boosted trees” (ABT), is proposed and, in a simulation study, is shown to reduce prediction error relative to boosted trees. A regression data set is analyzed using ABT to illustrate the technique and to compare it with other methods, including boosted trees, bagged trees, random forests, and generalized additive models. A software package for ABT analysis using the R software environment is included in the Appendices together with worked examples.Keywords
This publication has 20 references indexed in Scilit:
- SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivationNature Genetics, 2008
- Variation in demersal fish species richness in the oceans surrounding New Zealand: an analysis using boosted regression treesMarine Ecology Progress Series, 2006
- Development of a robust classifier of freshwater residence in barramundi (Lates calcarifer) life histories using elemental ratios in scales and boosted regression treesMarine and Freshwater Research, 2005
- Least angle regressionThe Annals of Statistics, 2004
- MULTIVARIATE REGRESSION TREES: A NEW TECHNIQUE FOR MODELING SPECIES–ENVIRONMENT RELATIONSHIPSEcology, 2002
- Greedy function approximation: A gradient boosting machine.The Annals of Statistics, 2001
- Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the authors)The Annals of Statistics, 2000
- The Insignificance of Statistical Significance TestingThe Journal of Wildlife Management, 1999
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974