Integrative analysis of genome-scale data by using pseudoinverse projection predicts novel correlation between DNA replication and RNA transcription
Open Access
- 15 November 2004
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 101 (47) , 16577-16582
- https://doi.org/10.1073/pnas.0406767101
Abstract
We describe an integrative data-driven mathematical framework that formulates any number of genome-scale molecular biological data sets in terms of one chosen set of data samples, or of profiles extracted mathematically from data samples, designated the “basis” set. By using pseudoinverse projection, the molecular biological profiles of the data samples are least-squares-approximated as superpositions of the basis profiles. Reconstruction of the data in the basis simulates experimental observation of only the cellular states manifest in the data that correspond to those of the basis. Classification of the data samples according to their reconstruction in the basis, rather than their overall measured profiles, maps the cellular states of the data onto those of the basis and gives a global picture of the correlations and possibly also causal coordination of these two sets of states. We illustrate this framework with an integration of yeast genome-scale proteins' DNA-binding data with cell cycle mRNA expression time course data. Novel correlation between DNA replication initiation and RNA transcription during the yeast cell cycle, which might be due to a previously unknown mechanism of regulation, is predicted.Keywords
This publication has 17 references indexed in Scilit:
- Expression deconvolution: A reinterpretation of DNA microarray data reveals dynamic changes in cell populationsProceedings of the National Academy of Sciences, 2003
- Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organismsProceedings of the National Academy of Sciences, 2003
- Identification of Genes Periodically Expressed in the Human Cell Cycle and Their Expression in TumorsMolecular Biology of the Cell, 2002
- Genome-Wide Distribution of ORC and MCM Proteins in S. cerevisiae : High-Resolution Mapping of Replication OriginsScience, 2001
- Serial Regulation of Transcriptional Regulators in the Yeast Cell CycleCell, 2001
- Processing and modeling genome-wide expression data using singular value decompositionPublished by SPIE-Intl Soc Optical Eng ,2001
- Regulatory element detection using correlation with expressionNature Genetics, 2001
- Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBFNature, 2001
- Singular value decomposition for genome-wide expression data processing and modelingProceedings of the National Academy of Sciences, 2000
- Comprehensive Identification of Cell Cycle–regulated Genes of the YeastSaccharomyces cerevisiaeby Microarray HybridizationMolecular Biology of the Cell, 1998