Coupled two-way clustering analysis of gene microarray data
Top Cited Papers
- 17 October 2000
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 97 (22) , 12079-12084
- https://doi.org/10.1073/pnas.210134797
Abstract
We present a coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant partitions emerge. The search for such subsets is a computationally complex task. We present an algorithm, based on iterative clustering, that performs such a search. This analysis is especially suitable for gene microarray data, where the contributions of a variety of biological mechanisms to the gene expression levels are entangled in a large body of experimental data. The method was applied to two gene microarray data sets, on colon cancer and leukemia. By identifying relevant subsets of the data and focusing on them we were able to discover partitions and correlations that were masked and hidden when the full dataset was used in the analysis. Some of these partitions have clear biological interpretation; others can serve to identify possible directions for future research.Keywords
All Related Versions
This publication has 15 references indexed in Scilit:
- Super-paramagnetic clustering of yeast gene expression profilesPhysica A: Statistical Mechanics and its Applications, 2000
- Distinct types of diffuse large B-cell lymphoma identified by gene expression profilingNature, 2000
- A combined algorithm for genome-wide prediction of protein functionNature, 1999
- Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression MonitoringScience, 1999
- Distinctive gene expression patterns in human mammary epithelial cells and breast cancersProceedings of the National Academy of Sciences, 1999
- Superparamagnetic clustering of data — The definitive solution of an ill-posed problemPhysica A: Statistical Mechanics and its Applications, 1999
- Array of hopeNature Genetics, 1999
- Data Clustering Using a Model Granular MagnetNeural Computation, 1997
- Parallel human genome analysis: microarray-based expression monitoring of 1000 genes.Proceedings of the National Academy of Sciences, 1996
- Superparamagnetic Clustering of DataPhysical Review Letters, 1996