Gene Clustering Based on Clusterwide Mutual Information
- 1 January 2004
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 11 (1) , 147-161
- https://doi.org/10.1089/106652704773416939
Abstract
Cluster analysis of gene-wide expression data from DNA microarray hybridization studies has proved to be a useful tool for identifying biologically relevant groupings of genes and constructing gene regulatory networks. The motivation for considering mutual information is its capacity to measure a general dependence among gene random variables. We propose a novel clustering strategy based on minimizing mutual information among gene clusters. Simulated annealing is employed to solve the optimization problem. Bootstrap techniques are employed to get more accurate estimates of mutual information when the data sample size is small. Moreover, we propose to combine the mutual information criterion and traditional distance criteria such as the Euclidean distance and the fuzzy membership metric in designing the clustering algorithm. The performances of the new clustering methods are compared with those of some existing methods, using both synthesized data and experimental data. It is seen that the clustering algorithm based on a combined metric of mutual information and fuzzy membership achieves the best performance. The supplemental material is available at www.gspsnap.tamu.edu/gspweb/zxb/glioma_zxb.Keywords
This publication has 28 references indexed in Scilit:
- Inference from Clustering with Application to Gene-Expression MicroarraysJournal of Computational Biology, 2002
- Molecular classification of cutaneous malignant melanoma by gene expression profilingNature, 2000
- Inferring qualitative relations in genetic networks and metabolic pathwaysBioinformatics, 2000
- Using Bayesian Networks to Analyze Expression DataJournal of Computational Biology, 2000
- Knowledge-based analysis of microarray gene expression data by using support vector machinesProceedings of the National Academy of Sciences, 2000
- Clustering Gene Expression PatternsJournal of Computational Biology, 1999
- Computational methods for theidentification of differential and coordinated gene expressionHuman Molecular Genetics, 1999
- DNA microarrays in drug discovery and developmentNature Genetics, 1999
- Cluster analysis and display of genome-wide expression patternsProceedings of the National Academy of Sciences, 1998
- Ratio-based decisions and the quantitative analysis of cDNA microarray imagesJournal of Biomedical Optics, 1997