Clustering Based on Conditional Distributions in an Auxiliary Space
- 1 January 2002
- journal article
- Published by MIT Press in Neural Computation
- Vol. 14 (1) , 217-239
- https://doi.org/10.1162/089976602753284509
Abstract
We study the problem of learning groups or categories that are local in the continuous primary space but homogeneous by the distributions of an associated auxiliary random variable over a discrete auxiliary space. Assuming that variation in the auxiliary space is meaningful, categories will emphasize similarly meaningful aspects of the primary space. From a data set consisting of pairs of primary and auxiliary items, the categories are learned by minimizing a Kullback-Leibler divergence-based distortion between (implicitly estimated) distributions of the auxiliary data, conditioned on the primary data. Still, the categories are defined in terms of the primary space. An online algorithm resembling the traditional Hebb-type competitive learning is introduced for learning the categories. Minimizing the distortion criterion turns out to be equivalent to maximizing the mutual information between the categories and the auxiliary data. In addition, connections to density estimation and to the distributional clustering paradigm are outlined. The method is demonstrated by clustering yeast gene expression data from DNA chips, with biological knowledge about the functional classes of the genes as the auxiliary data.Keywords
This publication has 14 references indexed in Scilit:
- Knowledge-based analysis of microarray gene expression data by using support vector machinesProceedings of the National Academy of Sciences, 2000
- Cluster analysis and display of genome-wide expression patternsProceedings of the National Academy of Sciences, 1998
- Winner-take-all networks for physiological models of competitive learningNeural Networks, 1994
- Self-organizing neural network that discovers surfaces in random-dot stereogramsNature, 1992
- Vector quantizationIEEE ASSP Magazine, 1984
- Simplified neuron model as a principal component analyzerJournal of Mathematical Biology, 1982
- Asymptotically optimal block quantizationIEEE Transactions on Information Theory, 1979
- On the development of feature detectors in the visual cortex with applications to learning and reaction-diffusion systemsBiological Cybernetics, 1976
- A model of visuomotor mechanisms in the frog optic tectumMathematical Biosciences, 1976
- A theory for the development of feature detecting cells in visual cortexBiological Cybernetics, 1975