Accelerating exactk-means algorithms with geometric reasoning

Abstract
We present new algorithms for the k-means clustering problem.They use the kd-tree data structure to reduce the large number ofnearest-neighbor queries issued by the traditional algorithm. Sufficientstatistics are stored in the nodes of the kd-tree. Then, an analysis ofthe geometry of the current cluster centers results in great reductionof the work needed to update the centers. Our algorithms behaveexactly as the traditional k-means algorithm. Proofs of correctnessare included. The...

This publication has 4 references indexed in Scilit: