Replicating Cluster Analysis: Method, Consistency, and Validity
- 1 April 1989
- journal article
- research article
- Published by Taylor & Francis in Multivariate Behavioral Research
- Vol. 24 (2) , 147-161
- https://doi.org/10.1207/s15327906mbr2402_1
Abstract
To replicate a cluster analysis, clusters must first be described in terms of an objective classification rule. The effectiveness of three rules (nearest neighbor classification, nearest centroid assignment, and quadratic discriminant analysis) for replicating Ward's algorithm (Ward, 1963) is evaluated by Monte Carlo study. Consistent replication links clusters and their replicas identically over alternative cross-validation sequences (i.e., A replicates B, B replicates A) and is associated with recovery of known clusters. Replication using nearest neighbor classification results in superior goodness-of-fit, more frequent consistent replication, and significant prediction of recovery. Although moderate or greater replication dentoes good recovery, replication is not a necessary condition of recovery of true clusters.Keywords
This publication has 18 references indexed in Scilit:
- Methodology Review: Clustering MethodsApplied Psychological Measurement, 1987
- A Study of the Comparability of External Criteria for Hierarchical Cluster AnalysisMultivariate Behavioral Research, 1986
- Comparing partitionsJournal of Classification, 1985
- Monte Carlo Tests of the Accuracy of Cluster Analysis Algorithms: A Comparison of Hierarchical and Nonhierarchical MethodsMultivariate Behavioral Research, 1985
- Remark AS R58: A Remark on Algorithm AS 183. An Efficient and Portable Pseudo-Random Number GeneratorJournal of the Royal Statistical Society Series C: Applied Statistics, 1985
- Differentiating the Contribution of Elevation, Scatter and Shape in Profile SimilarityEducational and Psychological Measurement, 1978
- Mixture model tests of cluster analysis: Accuracy of four agglomerative hierarchical methods.Psychological Bulletin, 1976
- Cross-Validatory Choice and Assessment of Statistical PredictionsJournal of the Royal Statistical Society Series B: Statistical Methodology, 1974
- A Coefficient of Agreement for Nominal ScalesEducational and Psychological Measurement, 1960
- Assessing similarity between profiles.Psychological Bulletin, 1953