Parameter priors for directed acyclic graphical models and the characterization of several probability distributions
Open Access
- 1 October 2002
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 30 (5) , 1412-1440
- https://doi.org/10.1214/aos/1035844981
Abstract
We develop simple methods for constructing parameter priors for model choice among directed acyclic graphical (DAG) models. In particular, we introduce several assumptions that permit the construction of parameter priors for a large number of DAG models from a small set of assessments. We then present a method for directly computing the marginal likelihood of every DAG model given a random sample with no missing observations. We apply this methodology to Gaussian DAG models which consist of a recursive set of linear regression models. We show that the only parameter prior for complete Gaussian DAG models that satisfies our assumptions is the normal-Wishart distribution. Our analysis is based on the following new characterization of the Wishart distribution: let $W$ be an $n \times n$, $n \ge 3$, positive definite symmetric matrix of random variables and $f(W)$ be a pdf of $W$. Then, $f(W)$ is a Wishart distribution if and only if $W_{11} - W_{12} W_{22}^{-1} W'_{12}$ is independent of $\{W_{12},W_{22}\}$ for every block partitioning $W_{11},W_{12}, W'_{12}, W_{22}$ of $W$. Similar characterizations of the normal and normal-Wishart distributions are provided as well.
Keywords
All Related Versions
This publication has 19 references indexed in Scilit:
- A characterization of the Dirichlet distribution through global and local parameter independenceThe Annals of Statistics, 1997
- A characterization of Markov equivalence classes for acyclic digraphsThe Annals of Statistics, 1997
- Bayesian model averaging and model selection for markov equivalence classes of acyclic digraphsCommunications in Statistics - Theory and Methods, 1996
- Learning Bayesian networks: The combination of knowledge and statistical dataMachine Learning, 1995
- Real-world applications of Bayesian networksCommunications of the ACM, 1995
- Hyper Markov Laws in the Statistical Analysis of Decomposable Graphical ModelsThe Annals of Statistics, 1993
- Bayesian Analysis in Expert SystemsStatistical Science, 1993
- A Bayesian method for the induction of probabilistic networks from dataMachine Learning, 1992
- Sequential updating of conditional probabilities on directed graphical structuresNetworks, 1990
- Interactive Elicitation of Opinion for a Normal Linear ModelJournal of the American Statistical Association, 1980