Some new indexes of cluster validity
- 1 June 1998
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
- Vol. 28 (3) , 301-315
- https://doi.org/10.1109/3477.678624
Abstract
We review two clustering algorithms (hard c-means and single linkage) and three indexes of crisp cluster validity (Hubert's statistics, the Davies-Bouldin index, and Dunn's index). We illustrate two deficiencies of Dunn's index which make it overly sensitive to noisy clusters and propose several generalizations of it that are not as brittle to outliers in the clusters. Our numerical examples show that the standard measure of interset distance (the minimum distance between points in a pair of sets) is the worst (least reliable) measure upon which to base cluster validation indexes when the clusters are expected to form volumetric clouds. Experimental results also suggest that intercluster separation plays a more important role in cluster validation than cluster diameter. Our simulations show that while Dunn's original index has operational flaws, the concept it embodies provides a rich paradigm for validation of partitions that have cloud-like clusters. Five of our generalized Dunn's indexes provide the best validation results for the simulations presented.Keywords
This publication has 9 references indexed in Scilit:
- Multipactor breakdown in waveguide irisesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Sequencing-by-hybridization revisited: the analog-spectrum proposalIEEE/ACM Transactions on Computational Biology and Bioinformatics, 2004
- A geometric approach to cluster validity for normal mixturesSoft Computing, 1997
- Cluster validation using graph theoretic conceptsPattern Recognition, 1997
- A possibilistic approach to clusteringIEEE Transactions on Fuzzy Systems, 1993
- Comparing partitionsJournal of Classification, 1985
- Pattern Recognition with Fuzzy Objective Function AlgorithmsPublished by Springer Nature ,1981
- A Cluster Separation MeasureIEEE Transactions on Pattern Analysis and Machine Intelligence, 1979
- A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated ClustersJournal of Cybernetics, 1973