Meta-clustering of gene expression data and literature-based information
- 1 December 2003
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGKDD Explorations Newsletter
- Vol. 5 (2) , 101-112
- https://doi.org/10.1145/980972.980985
Abstract
The current tendency in the life sciences to spawn ever growing amounts of high-throughput assays has led to a situation where the interpretation of data and the formulation of hypotheses lag the pace at which information is produced. Although the first generation of statistical algorithms scrutinizing single, large-scale data sets found their way into the biological community, the great challenge to connect their results to existing knowledge still remains. Despite the fairly large number of biological databases that is currently available, a lot of relevant information is found in free-text format (such as textual annotations, scientific abstracts and full publications). In this paper we explore how an integrated analysis of expression data and literature-extracted information can reveal biologically meaningful clusters not identified when using microarray information alone. The joint analysis is validated in terms of transcriptional regulation.Keywords
This publication has 33 references indexed in Scilit:
- Module networks: identifying regulatory modules and their condition-specific regulators from gene expression dataNature Genetics, 2003
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002
- Judging the Quality of Gene Expression-Based Clustering Methods Using Gene AnnotationGenome Research, 2002
- Learning Gene Functional Classifications from Multiple Data TypesJournal of Computational Biology, 2002
- Associating Genes with Gene Ontology Codes Using a Maximum Entropy Analysis of Biomedical LiteratureGenome Research, 2002
- Information retrieval meets gene analysisIEEE Intelligent Systems and their Applications, 2002
- Computational analysis of microarray dataNature Reviews Genetics, 2001
- A literature network of human genes for high-throughput analysis of gene expressionNature Genetics, 2001
- Linking microarray data to the literatureNature Genetics, 2001
- Computational identification of Cis -regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae 1 1Edited by F. E. CohenJournal of Molecular Biology, 2000