Utilizing logical relationships in genomic data to decipher cellular processes
- 7 October 2005
- journal article
- review article
- Published by Wiley in The FEBS Journal
- Vol. 272 (20) , 5110-5118
- https://doi.org/10.1111/j.1742-4658.2005.04946.x
Abstract
The wealth of available genomic data has spawned a corresponding interest in computational methods that can impart biological meaning and context to these experiments. Traditional computational methods have drawn relationships between pairs of proteins or genes based on notions of equality or similarity between their patterns of occurrence or behavior. For example, two genes displaying similar variation in expression, over a number of experiments, may be predicted to be functionally related. We have introduced a natural extension of these approaches, instead identifying logical relationships involving triplets of proteins. Triplets provide for various discrete kinds of logic relationships, leading to detailed inferences about biological associations. For instance, a protein C might be encoded within an organism if, and only if, two other proteins A and B are also both encoded within the organism, thus suggesting that gene C is functionally related to genes A and B. The method has been applied fruitfully to both phylogenetic and microarray expression data, and has been used to associate logical combinations of protein activity with disease state phenotypes, revealing previously unknown ternary relationships among proteins, and illustrating the inherent complexities that arise in biological data.Keywords
This publication has 52 references indexed in Scilit:
- Systematic Association of Genes to Phenotypes by Genome and Literature MiningPLoS Biology, 2005
- Differential Expression of Glucose Transporters in Normal and Pathologic Thyroid TissueThyroid®, 2004
- A Map of the Interactome Network of the Metazoan C. elegansScience, 2004
- A Protein Interaction Map of Drosophila melanogasterScience, 2003
- Transitive functional annotation by shortest-path analysis of gene expression dataProceedings of the National Academy of Sciences, 2002
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometryNature, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- A comprehensive two-hybrid analysis to explore the yeast protein interactomeProceedings of the National Academy of Sciences, 2001
- Using Bayesian Networks to Analyze Expression DataJournal of Computational Biology, 2000