Identifying Markov blankets with decision tree induction
- 23 April 2004
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The Markov blanket of a target variable is the minimum conditioning set of variables that makes the target independent of all other variables. Markov blankets inform feature selection, aid in causal discovery and serve as a basis for scalable methods of constructing Bayesian networks. We apply decision tree induction to the task of Markov blanket identification. Notably, we compare (a) C5.0, a widely used algorithm for decision rule induction, (b) C5C, which post-processes C5.0 's rule set to retain the most frequently referenced variables and (c) PC, a standard method for Bayesian network induction. C5C performs as well as or better than C5.0 and PC across a number of data sets. Our modest variation of an inexpensive, accurate, off-the-shelf induction engine mitigates the need for specialized procedures, and establishes baseline performance against which specialized algorithms can be compared.Keywords
This publication has 10 references indexed in Scilit:
- Time and sample efficient discovery of Markov blankets and direct causal relationsPublished by Association for Computing Machinery (ACM) ,2003
- Learning Bayesian networks from data: An information-theory based approachArtificial Intelligence, 2002
- The use of a Bayesian network in the design of a decision support system for growing malting barley without use of pesticidesComputers and Electronics in Agriculture, 2002
- Wrappers for feature subset selectionArtificial Intelligence, 1997
- The hardwiring of development: organization and function of genomic regulatory systemsDevelopment, 1997
- Adaptive Probabilistic Networks with Hidden VariablesMachine Learning, 1997
- Hailfinder: A Bayesian system for forecasting severe weatherInternational Journal of Forecasting, 1996
- Greedy Attribute SelectionPublished by Elsevier ,1994
- Using Decision Trees to Improve Case-Based LearningPublished by Elsevier ,1993
- Induction of decision treesMachine Learning, 1986