The choice of variables in multivariate regression: a non-conjugate Bayesian decision theory approach

1 September 1999

journal article
research article
Published by Oxford University Press (OUP) in Biometrika

Vol. 86 (3) , 635-648
https://doi.org/10.1093/biomet/86.3.635

Abstract

We consider the choice of explanatory variables in multivariate linear regression. Our approach balances prediction accuracy against costs attached to variables in a multivariate version of a decision theory approach pioneered by Lindley (1968). We also employ a non-conjugate proper prior distribution for the parameters of the regression model, extending the standard normal-inverse Wishart by adding a component of error which is unexplainable by any number of predictor variables, thus avoiding the determinism identified by Dawid (1988). Simulated annealing and fast updating algorithms are used to search for good subsets when there are very many regressors. The technique is illustrated on a near infrared spectroscopy example involving 39 observations and 300 explanatory variables. This demonstrates the effectiveness of multivariate regression as opposed to separate univariate regressions. It also emphasises that within a Bayesian framework more variables than observations can be utilised.

Keywords

This publication has 0 references indexed in Scilit: