The choice of variables in multivariate regression: a non-conjugate Bayesian decision theory approach
- 1 September 1999
- journal article
- research article
- Published by Oxford University Press (OUP) in Biometrika
- Vol. 86 (3) , 635-648
- https://doi.org/10.1093/biomet/86.3.635
Abstract
We consider the choice of explanatory variables in multivariate linear regression. Our approach balances prediction accuracy against costs attached to variables in a multivariate version of a decision theory approach pioneered by Lindley (1968). We also employ a non-conjugate proper prior distribution for the parameters of the regression model, extending the standard normal-inverse Wishart by adding a component of error which is unexplainable by any number of predictor variables, thus avoiding the determinism identified by Dawid (1988). Simulated annealing and fast updating algorithms are used to search for good subsets when there are very many regressors. The technique is illustrated on a near infrared spectroscopy example involving 39 observations and 300 explanatory variables. This demonstrates the effectiveness of multivariate regression as opposed to separate univariate regressions. It also emphasises that within a Bayesian framework more variables than observations can be utilised.Keywords
This publication has 0 references indexed in Scilit: