Imputation of Missing Categorical Data by Maximizing Internal Consistency
- 1 December 1992
- journal article
- Published by Cambridge University Press (CUP) in Psychometrika
- Vol. 57 (4) , 567-580
- https://doi.org/10.1007/bf02294420
Abstract
This paper suggests a method to supplant missing categorical data by “reasonable” replacements. These replacements will maximize the consistency of the completed data as measured by Guttman's squared correlation ratio. The text outlines a solution of the optimization problem, describes relationships with the relevant psychometric theory, and studies some properties of the method in detail. The main result is that the average correlation should be at least 0.50 before the method becomes practical. At that point, the technique gives reasonable results up to 10–15% missing data.Keywords
This publication has 10 references indexed in Scilit:
- EM and BeyondPsychometrika, 1991
- Clustering N Objects into K Groups under Optimal Scaling of VariablesPsychometrika, 1989
- Multiple Imputation for Nonresponse in SurveysPublished by Wiley ,1987
- The Calculation of Posterior Distributions by Data AugmentationJournal of the American Statistical Association, 1987
- Monte Carlo Tests of the Accuracy of Cluster Analysis Algorithms: A Comparison of Hierarchical and Nonhierarchical MethodsMultivariate Behavioral Research, 1985
- Analysis of Categorical DataPublished by University of Toronto Press Inc. (UTPress) ,1980
- An Examination of the Effect of Six Types of Error Perturbation on Fifteen Clustering AlgorithmsPsychometrika, 1980
- A Proposal for Handling Missing DataPsychometrika, 1975
- The Analysis of Incomplete DataPublished by JSTOR ,1971
- On Grouping for Maximum HomogeneityJournal of the American Statistical Association, 1958