Data clustering and noise undressing for correlation matrices
Preprint
- 16 January 2001
Abstract
We discuss a new approach to data clustering. We find that maximum likelihood leads naturally to an Hamiltonian of Potts variables which depends on the correlation matrix and whose low temperature behavior describes the correlation structure of the data. For random, uncorrelated data sets no correlation structure emerges. On the other hand for data sets with a built-in cluster structure, the method is able to detect and recover efficiently that structure. Finally we apply the method to financial time series, where the low temperature behavior reveals a non trivial clustering.Keywords
All Related Versions
- Version 1, 2001-01-16, ArXiv
- Published version: Physical Review E, 63 (6), 061101.
This publication has 0 references indexed in Scilit: