Using Bayesian networks to analyze expression data
- 8 April 2000
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
DNA hybridization arrays simultaneously measure the expression level for thousands of genes. These measurements provide a “snapshot” of transcription levels within the cell. A major challenge in computational biology is to uncover, from such measurements, gene/protein interactions and key biological features of cellular systems. In this paper, we propose a new framework for discovering interactions between genes based on multiple expression measurements This framework builds on the use of Bayesian networks for representing statistical dependencies. A Bayesian network is a graph-based model of joint multi-variate probability distributions that captures properties of conditional independence between variables. Such models are attractive for their ability to describe complex stochastic processes, and for providing clear methodologies for learning from (noisy) observations. We start by showing how Bayesian networks can describe interactions between genes. We then present an efficient algorithm capable of learning such networks and statistical method to assess our confidence in their features. Finally, we apply this method to the S. cerevisiae cell-cycle measurements of Spellman et al. [35] to uncover biological featuresKeywords
This publication has 17 references indexed in Scilit:
- Clustering Gene Expression PatternsJournal of Computational Biology, 1999
- Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arraysProceedings of the National Academy of Sciences, 1999
- The Transcriptional Program in the Response of Human Fibroblasts to SerumScience, 1999
- Effect of DNA lesions on transcription elongationBiochimie, 1999
- Cluster analysis and display of genome-wide expression patternsProceedings of the National Academy of Sciences, 1998
- Comprehensive Identification of Cell Cycle–regulated Genes of the YeastSaccharomyces cerevisiaeby Microarray HybridizationMolecular Biology of the Cell, 1998
- Pfam: multiple sequence alignments and HMM-profiles of protein domainsNucleic Acids Research, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Expression monitoring by hybridization to high-density oligonucleotide arraysNature Biotechnology, 1996
- An impaired RNA polymerase II activity in Saccharomyces cerevisiae causes cell-cycle inhibition at STARTMolecular Genetics and Genomics, 1993