An information theoretic approach to rule induction from databases
- 1 August 1992
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 4 (4) , 301-316
- https://doi.org/10.1109/69.149926
Abstract
An algorithm for the induction of rules from examples is introduced. The algorithm is novel in the sense that it not only learns rules for a given concept (classification), but it simultaneously learns rules relating multiple concepts. This type of learning, known as generalized rule induction, is considerably more general than existing algorithms, which tend to be classification oriented. Initially, it is focused on the problem of determining a quantitative, well-defined rule preference measure. In particular, a quantity called the J-measure is proposed as an information-theoretic alternative to existing approaches. The J-measure quantifies the information content of a rule or a hypothesis. The information theoretic origins of this measure are outlined, and its plausibility as a hypothesis preference measure is examined. The ITRULE algorithm, which uses the measure to learn a set of optimal rules from a set of data samples, is defined. Experimental results on real-world data are analyzed.<>Keywords
This publication has 22 references indexed in Scilit:
- Learning decision listsMachine Learning, 1987
- Bias, Version Spaces and Valiant's Learning FrameworkPublished by Elsevier ,1987
- A theory of the learnableCommunications of the ACM, 1984
- Inductive Inference: Theory and MethodsACM Computing Surveys, 1983
- Pattern Recognition as Rule-Guided Inductive InferencePublished by Institute of Electrical and Electronics Engineers (IEEE) ,1980
- Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropyIEEE Transactions on Information Theory, 1980
- Behaviour/structure transformations under uncertaintyInternational Journal of Man-Machine Studies, 1976
- The amount of information that y gives about XIEEE Transactions on Information Theory, 1968
- Language identification in the limitInformation and Control, 1967
- A Mathematical Theory of CommunicationBell System Technical Journal, 1948