Similarities and Differences in Genome-Wide Expression Data of Six Organisms
Open Access
- 15 December 2003
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Biology
- Vol. 2 (1) , e9
- https://doi.org/10.1371/journal.pbio.0020009
Abstract
Comparing genomic properties of different organisms is of fundamental importance in the study of biological and evolutionary principles. Although differences among organisms are often attributed to differential gene expression, genome-wide comparative analysis thus far has been based primarily on genomic sequence information. We present a comparative study of large datasets of expression profiles from six evolutionarily distant organisms: S. cerevisiae, C. elegans, E. coli, A. thaliana, D. melanogaster, and H. sapiens. We use genomic sequence information to connect these data and compare global and modular properties of the transcription programs. Linking genes whose expression profiles are similar, we find that for all organisms the connectivity distribution follows a power-law, highly connected genes tend to be essential and conserved, and the expression program is highly modular. We reveal the modular structure by decomposing each set of expression data into coexpressed modules. Functionally related sets of genes are frequently coexpressed in multiple organisms. Yet their relative importance to the transcription program and their regulatory relationships vary among organisms. Our results demonstrate the potential of combining sequence and expression data for improving functional gene annotation and expanding our understanding of how gene expression and diversity evolved.Keywords
This publication has 39 references indexed in Scilit:
- Conserved pathways within bacteria and yeast as revealed by global protein network alignmentProceedings of the National Academy of Sciences, 2003
- Iterative signature algorithm for the analysis of large-scale gene expression dataPhysical Review E, 2003
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002
- Hierarchical Organization of Modularity in Metabolic NetworksScience, 2002
- Functional profiling of the Saccharomyces cerevisiae genomeNature, 2002
- Network motifs in the transcriptional regulation network of Escherichia coliNature Genetics, 2002
- Statistical mechanics of complex networksReviews of Modern Physics, 2002
- Identification of Potential Interaction Networks Using Sequence-Based Searches for Conserved Protein-Protein Interactions or “Interologs”Genome Research, 2001
- Highly Optimized Tolerance: Robustness and Design in Complex SystemsPhysical Review Letters, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997