CLICK and EXPANDER: a system for clustering and visualizing gene expression data
Open Access
- 22 September 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 19 (14) , 1787-1799
- https://doi.org/10.1093/bioinformatics/btg232
Abstract
Motivation: Microarrays have become a central tool in biological research. Their applications range from functional annotation to tissue classification and genetic network inference. A key step in the analysis of gene expression data is the identification of groups of genes that manifest similar expression patterns. This translates to the algorithmic problem of clustering genes based on their expression patterns. Results: We present a novel clustering algorithm, called CLICK, and its applications to gene expression analysis. The algorithm utilizes graph-theoretic and statistical techniques to identify tight groups (kernels) of highly similar elements, which are likely to belong to the same true cluster. Several heuristic procedures are then used to expand the kernels into the full clusters. We report on the application of CLICK to a variety of gene expression data sets. In all those applications it outperformed extant algorithms according to several common figures of merit. We also point out that CLICK can be successfully used for the identification of common regulatory motifs in the upstream regions of co-regulated genes. Furthermore, we demonstrate how CLICK can be used to accurately classify tissue samples into disease types, based on their expression profiles. Finally, we present a new java-based graphical tool, called EXPANDER, for gene expression analysis and visualization, which incorporates CLICK and several other popular clustering algorithms. Availability:http://www.cs.tau.ac.il/~rshamir/expander/expander.htmlKeywords
This publication has 8 references indexed in Scilit:
- Identification of Genes Periodically Expressed in the Human Cell Cycle and Their Expression in TumorsMolecular Biology of the Cell, 2002
- A clustering algorithm based on graph connectivityInformation Processing Letters, 2000
- An Algorithm for Clustering cDNA FingerprintsGenomics, 2000
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000
- Large-Scale Clustering of cDNA-Fingerprinting DataGenome Research, 1999
- Clustering Gene Expression PatternsJournal of Computational Biology, 1999
- Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiationProceedings of the National Academy of Sciences, 1999
- Cluster analysis and mathematical programmingMathematical Programming, 1997