A Comparison of Segment Retention Criteria for Finite Mixture Logit Models
- 1 May 2003
- journal article
- research article
- Published by SAGE Publications in Journal of Marketing Research
- Vol. 40 (2) , 235-243
- https://doi.org/10.1509/jmkr.40.2.235.19225
Abstract
Despite the widespread application of finite mixture models in marketing research, the decision of how many segments to retain in the models is an important unresolved issue. Almost all applications of the models in marketing rely on segment retention criteria such as Akaike's information criterion, Bayesian information criterion, consistent Akaike's information criterion, and information complexity to determine the number of latent segments to retain. Because these applications employ real-world data in which the true number of segments is unknown, it is not clear whether these criteria are effective. Retaining the true number of segments is crucial because many product design and marketing decisions depend on it. The purpose of this extensive simulation study is to determine how well commonly used segment retention criteria perform in the context of simulated multinomial choice data, as obtained from supermarket scanner panels, in which the true number of segments is known. The authors find that an Akaike's information criterion with a penalty factor of three rather than the traditional value of two has the highest segment retention success rate across nearly all experimental conditions. Currently, this criterion is rarely, if ever, applied in the marketing literature. Experimental factors of particular interest in marketing contexts, such as the number of choices per household, the number of choice alternatives, the error variance of the choices, and the minimum segment size, have not been considered in the statistics literature. The authors show that they, among other factors, affect the performance of segment retention criteria.Keywords
This publication has 35 references indexed in Scilit:
- Identifying segments with identical choice behaviors across product categories: An Intercategory Logit Mixture modelInternational Journal of Research in Marketing, 2002
- An entropy criterion for assessing the number of clusters in a mixture modelJournal of Classification, 1996
- Issues in the estimation and application of latent structure models of choiceMarketing Letters, 1994
- Measuring brand value with scanner dataInternational Journal of Research in Marketing, 1993
- Information Ratios for Validating Mixture AnalysesJournal of the American Statistical Association, 1992
- Latent class metric conjoint analysisMarketing Letters, 1992
- On the information-based measure of covariance complexity and its application to the evaluation of multivariate linear modelsCommunications in Statistics - Theory and Methods, 1990
- Statistical Modelling of Data on Teaching StylesJournal of the Royal Statistical Society. Series A (General), 1981
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- On the Distribution of the Log Likelihood Ratio Test Statistic When the True Parameter is "Near" the Boundaries of the Hypothesis RegionsThe Annals of Mathematical Statistics, 1968