Comparative analysis of clustering methods for gene expression time course data
Open Access
- 1 January 2004
- journal article
- Published by FapUNIFESP (SciELO) in Genetics and Molecular Biology
- Vol. 27 (4) , 623-631
- https://doi.org/10.1590/s1415-47572004000400025
Abstract
This work performs a data driven comparative study of clustering methods used in the analysis of gene expression time courses (or time series). Five clustering methods found in the literature of gene expression analysis are compared: agglomerative hierarchical clustering, CLICK, dynamical clustering, k-means and self-organizing maps. In order to evaluate the methods, a k-fold cross-validation procedure adapted to unsupervised methods is applied. The accuracy of the results is assessed by the comparison of the partitions obtained in these experiments with gene annotation, such as protein function and series classification.Keywords
This publication has 15 references indexed in Scilit:
- Comparisons and validation of statistical clustering techniques for microarray gene expression dataBioinformatics, 2003
- Computational analysis of microarray dataNature Reviews Genetics, 2001
- Knowledge-based analysis of microarray gene expression data by using support vector machinesProceedings of the National Academy of Sciences, 2000
- Exploring Expression Data: Identification and Analysis of Coexpressed GenesGenome Research, 1999
- Data clusteringACM Computing Surveys, 1999
- Cluster analysis and display of genome-wide expression patternsProceedings of the National Academy of Sciences, 1998
- A Genome-Wide Transcriptional Analysis of the Mitotic Cell CycleMolecular Cell, 1998
- A comparison of SOM neural network and hierarchical clustering methodsEuropean Journal of Operational Research, 1996
- How many clusters are best? - An experimentPattern Recognition, 1987
- A Study of the Comparability of External Criteria for Hierarchical Cluster AnalysisMultivariate Behavioral Research, 1986