Resampling Method For Unsupervised Estimation Of Cluster Validity
Preprint
- 18 May 2000
Abstract
We introduce a method for validation of results obtained by clustering analysis of data. The method is based on resampling the available data. A figure of merit that measures the stability of clustering solutions against resampling is introduced. Clusters which are stable against resampling give rise to local maxima of this figure of merit. This is presented first for a one-dimensional data set, for which an analytic approximation for the figure of merit is derived and compared with numerical measurements. Next, the applicability of the method is demonstrated for higher dimensional data, including gene microarray expression data.Keywords
All Related Versions
This publication has 0 references indexed in Scilit: