Determining the Number of Clusters by Sampling With Replacement.
- 1 January 2004
- journal article
- Published by American Psychological Association (APA) in Psychological Methods
- Vol. 9 (2) , 238-249
- https://doi.org/10.1037/1082-989x.9.2.238
Abstract
A split-sample replication criterion originally proposed by J. E. Overall and K. N. Magee (1992) as a stopping rule for hierarchical cluster analysis is applied to multiple data sets generated by sampling with replacement from an original simulated primary data set. An investigation of the validity of this bootstrap procedure was undertaken using different combinations of the true number of latent populations, degrees of overlap, and sample sizes. The bootstrap procedure enhanced the accuracy of identifying the true number of latent populations under virtually all conditions. Increasing the size of the resampled data sets relative to the size of the primary data set further increased accuracy. A computer program to implement the bootstrap stopping rule is made available via a referenced Web site.Keywords
This publication has 15 references indexed in Scilit:
- A Cautionary Note on using Internal Cross Validation to Select the Number of ClustersPsychometrika, 1999
- Comparative Evaluation of Two Superior Stopping Rules for Hierarchical Cluster AnalysisPsychometrika, 1994
- Replication as a Rule for Determining the Number of Clusters in Hierarchial Cluster AnalysisApplied Psychological Measurement, 1992
- Comparing partitionsJournal of Classification, 1985
- An Examination of Procedures for Determining the Number of Clusters in a Data SetPsychometrika, 1985
- Multivariate analyses of the MMPI profiles of low back pain patientsJournal of Behavioral Medicine, 1978
- A dendrite method for cluster analysisCommunications in Statistics - Theory and Methods, 1974
- Classification of Depressed Patients: A Cluster Analysis Derived GroupingThe British Journal of Psychiatry, 1971
- Nosology of depression and differential response to drugsPublished by American Medical Association (AMA) ,1966
- Hierarchical Grouping to Optimize an Objective FunctionJournal of the American Statistical Association, 1963