Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes
- 21 August 2003
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 100 (18) , 10146-10151
- https://doi.org/10.1073/pnas.1732547100
Abstract
We present a general algorithm to detect genes differentially expressed between two nonhomogeneous time-series data sets. As increasing amounts of high-throughput biological data become available, a major challenge in genomic and computational biology is to develop methods for comparing data from different experimental sources. Time-series whole-genome expression data are a particularly valuable source of information because they can describe an unfolding biological process such as the cell cycle or immune response. However, comparisons of time-series expression data sets are hindered by biological and experimental inconsistencies such as differences in sampling rate, variations in the timing of biological processes, and the lack of repeats. Our algorithm overcomes these difficulties by using a continuous representation for time-series data and combining a noise model for individual samples with a global difference measure. We introduce a corresponding statistical method for computing the significance of this differential expression measure. We used our algorithm to compare cell-cycle-dependent gene expression in wild-type and knockout yeast strains. Our algorithm identified a set of 56 differentially expressed genes, and these results were validated by using independent protein-DNA-binding data. Unlike previous methods, our algorithm was also able to identify 22 non-cell-cycle-regulated genes as differentially expressed. This set of genes is significantly correlated in a set of independent expression experiments, suggesting additional roles for the transcription factors Fkh1 and Fkh2 in controlling cellular activity in yeast.Keywords
This publication has 23 references indexed in Scilit:
- Continuous Representations of Time-Series Gene Expression DataJournal of Computational Biology, 2003
- Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organismsProceedings of the National Academy of Sciences, 2003
- Conserved homeodomain proteins interact with MADS box protein Mcm1 to restrict ECB-dependent transcription to the M/G1 phase of the cell cycleGenes & Development, 2002
- Human macrophage activation programs induced by bacterial pathogensProceedings of the National Academy of Sciences, 2002
- Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactionsJournal of Molecular Biology, 2001
- The Plasticity of Dendritic Cell Responses to Pathogens and Their ComponentsScience, 2001
- Serial Regulation of Transcriptional Regulators in the Yeast Cell CycleCell, 2001
- Genomic Expression Programs in the Response of Yeast Cells to Environmental ChangesMolecular Biology of the Cell, 2000
- Functional Discovery via a Compendium of Expression ProfilesCell, 2000
- Comprehensive Identification of Cell Cycle–regulated Genes of the YeastSaccharomyces cerevisiaeby Microarray HybridizationMolecular Biology of the Cell, 1998