Identifying Markov blankets with decision tree induction

23 April 2004

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 59-66
https://doi.org/10.1109/icdm.2003.1250903

Abstract

The Markov blanket of a target variable is the minimum conditioning set of variables that makes the target independent of all other variables. Markov blankets inform feature selection, aid in causal discovery and serve as a basis for scalable methods of constructing Bayesian networks. We apply decision tree induction to the task of Markov blanket identification. Notably, we compare (a) C5.0, a widely used algorithm for decision rule induction, (b) C5C, which post-processes C5.0 's rule set to retain the most frequently referenced variables and (c) PC, a standard method for Bayesian network induction. C5C performs as well as or better than C5.0 and PC across a number of data sets. Our modest variation of an inexpensive, accurate, off-the-shelf induction engine mitigates the need for specialized procedures, and establishes baseline performance against which specialized algorithms can be compared.

Keywords

This publication has 10 references indexed in Scilit:

Time and sample efficient discovery of Markov blankets and direct causal relations
Published by Association for Computing Machinery (ACM) ,2003
Learning Bayesian networks from data: An information-theory based approach
Artificial Intelligence, 2002
The use of a Bayesian network in the design of a decision support system for growing malting barley without use of pesticides
Computers and Electronics in Agriculture, 2002
Wrappers for feature subset selection
Artificial Intelligence, 1997
The hardwiring of development: organization and function of genomic regulatory systems
Development, 1997
Adaptive Probabilistic Networks with Hidden Variables
Machine Learning, 1997
Hailfinder: A Bayesian system for forecasting severe weather
International Journal of Forecasting, 1996
Greedy Attribute Selection
Published by Elsevier ,1994
Using Decision Trees to Improve Case-Based Learning
Published by Elsevier ,1993
Induction of decision trees
Machine Learning, 1986