Model Selection Using Information Theory and the MDL Principle

1 November 2004

journal article
Published by SAGE Publications in Sociological Methods & Research

Vol. 33 (2) , 230-260
https://doi.org/10.1177/0049124103262064

Abstract

Information theory offers a coherent, intuitive view of model selection. This perspective arises from thinking of a statistical model as a code, an algorithm for compressing data into a sequence of bits. The description length is the length of this code for the data plus the length of a description of the model itself. The length of the code for the data measures the fit of the model to the data, whereas the length of the code for the model measures its complexity. The minimum description length (MDL) principle picks the model with smallest description length, balancing fit versus complexity. Variations on MDL reproduce other well-known methods of model selection. Going further, information theory allows one to choose from among various types of models, permitting the comparison of tree-based models to regressions. A running example compares several models for the well-known Boston housing data.

Keywords

This publication has 22 references indexed in Scilit:

Elements of Information Theory
Published by Wiley ,2001
Model Selection and the Principle of Minimum Description Length
Journal of the American Statistical Association, 2001
Calibration and empirical Bayes variable selection
Biometrika, 2000
Local asymptotic coding and the minimum description length
IEEE Transactions on Information Theory, 1999
The minimum description length principle in coding and modeling
IEEE Transactions on Information Theory, 1998
The Risk Inflation Criterion for Multiple Regression
The Annals of Statistics, 1994
Estimating Optimal Transformations for Multiple Regression and Correlation
Journal of the American Statistical Association, 1985
Regression Diagnostics
Published by Wiley ,1980
Hedonic housing prices and the demand for clean air
Journal of Environmental Economics and Management, 1978
Universal codeword sets and representations of the integers
IEEE Transactions on Information Theory, 1975