Clustering by Passing Messages Between Data Points
Top Cited Papers
- 16 February 2007
- journal article
- other
- Published by American Association for the Advancement of Science (AAAS) in Science
- Vol. 315 (5814) , 972-976
- https://doi.org/10.1126/science.1136800
Abstract
Clustering data by identifying a subset of representative examples is important for processing sensory signals and detecting patterns in data. Such “exemplars” can be found by randomly choosing an initial subset of data points and then iteratively refining it, but this works well only if that initial choice is close to a good solution. We devised a method called “affinity propagation,” which takes as input measures of similarity between pairs of data points. Real-valued messages are exchanged between data points until a high-quality set of exemplars and corresponding clusters gradually emerges. We used affinity propagation to cluster images of faces, detect genes in microarray data, identify representative sentences in this manuscript, and identify cities that are efficiently accessed by airline travel. Affinity propagation found clusters with much lower error than other methods, and it did so in less than one-hundredth the amount of time.Keywords
This publication has 12 references indexed in Scilit:
- Genome-wide analysis of mouse transcripts using exon microarrays and factor graphsNature Genetics, 2005
- A Split-Merge Markov chain Monte Carlo Procedure for the Dirichlet Process Mixture ModelJournal of Computational and Graphical Statistics, 2004
- Passing Messages Between DisciplinesScience, 2003
- Analytic and Algorithmic Solution of Random Satisfiability ProblemsScience, 2002
- A Constant-Factor Approximation Algorithm for the k-Median ProblemJournal of Computer and System Sciences, 2002
- Factor graphs and the sum-product algorithmIEEE Transactions on Information Theory, 2001
- Normalized cuts and image segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Good error-correcting codes based on very sparse matricesIEEE Transactions on Information Theory, 1999
- Near optimum error correcting coding and decoding: turbo-codesIEEE Transactions on Communications, 1996
- Neural networks and physical systems with emergent collective computational abilities.Proceedings of the National Academy of Sciences, 1982