Iterative signature algorithm for the analysis of large-scale gene expression data
Top Cited Papers
- 11 March 2003
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review E
- Vol. 67 (3) , 031902
- https://doi.org/10.1103/physreve.67.031902
Abstract
We present an approach for the analysis of genome-wide expression data. Our method is designed to overcome the limitations of traditional techniques, when applied to large-scale data. Rather than alloting each gene to a single cluster, we assign both genes and conditions to context-dependent and potentially overlapping transcription modules. We provide a rigorous definition of a transcription module as the object to be retrieved from the expression data. An efficient algorithm, which searches for the modules encoded in the data by iteratively refining sets of genes and conditions until they match this definition, is established. Each iteration involves a linear map, induced by the normalized expression matrix, followed by the application of a threshold function. We argue that our method is in fact a generalization of singular value decomposition, which corresponds to the special case where no threshold is applied. We show analytically that for noisy expression data our approach leads to better classification due to the implementation of the threshold. This result is confirmed by numerical analyses based on in silico expression data. We discuss briefly results obtained by applying our algorithm to expression data from the yeast Saccharomyces cerevisiae.Keywords
All Related Versions
This publication has 23 references indexed in Scilit:
- Navigating gene expression using microarrays — a technology reviewNature Cell Biology, 2001
- The Stanford Microarray DatabaseNucleic Acids Research, 2001
- Distinctive gene expression patterns in human mammary epithelial cells and breast cancersProceedings of the National Academy of Sciences, 1999
- Systematic determination of genetic network architectureNature Genetics, 1999
- Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arraysProceedings of the National Academy of Sciences, 1999
- Array of hopeNature Genetics, 1999
- Cluster analysis and display of genome-wide expression patternsProceedings of the National Academy of Sciences, 1998
- Comprehensive Identification of Cell Cycle–regulated Genes of the YeastSaccharomyces cerevisiaeby Microarray HybridizationMolecular Biology of the Cell, 1998
- Exploring the Metabolic and Genetic Control of Gene Expression on a Genomic ScaleScience, 1997
- Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA MicroarrayScience, 1995