Predicting gene targets of perturbations via network-based filtering of mRNA expression compendia
- 8 September 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (21) , 2482-2490
- https://doi.org/10.1093/bioinformatics/btn476
Abstract
Motivation: DNA microarrays are routinely applied to study diseased or drug-treated cell populations. A critical challenge is distinguishing the genes directly affected by these perturbations from the hundreds of genes that are indirectly affected. Here, we developed a sparse simultaneous equation model (SSEM) of mRNA expression data and applied Lasso regression to estimate the model parameters, thus constructing a network model of gene interaction effects. This inferred network model was then used to filter data from a given experimental condition of interest and predict the genes directly targeted by that perturbation. Results: Our proposed SSEM–Lasso method demonstrated substantial improvement in sensitivity compared with other tested methods for predicting the targets of perturbations in both simulated datasets and microarray compendia. In simulated data, for two different network types, and over a wide range of signal-to-noise ratios, our algorithm demonstrated a 167% increase in sensitivity on average for the top 100 ranked genes, compared with the next best method. Our method also performed well in identifying targets of genetic perturbations in microarray compendia, with up to a 24% improvement in sensitivity on average for the top 100 ranked genes. The overall performance of our network-filtering method shows promise for identifying the direct targets of genetic dysregulation in cancer and disease from expression profiles. Availability: Microarray data are available at the Many Microbe Microarrays Database (M3D, http://m3d.bu.edu). Algorithm scripts are available at the Gardner Lab website (http://gardnerlab.bu.edu/SSEMLasso). Contact: kolaczyk@math.bu.edu Supplementary information: Supplementary Data are available at Bioinformatics on line.Keywords
This publication has 33 references indexed in Scilit:
- An Introduction To Compressive SamplingIEEE Signal Processing Magazine, 2008
- Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression ProfilesPLoS Biology, 2007
- Gyrase inhibitors induce an oxidative damage cellular death pathway in Escherichia coliMolecular Systems Biology, 2007
- A network biology approach to prostate cancerMolecular Systems Biology, 2007
- NCBI GEO: mining tens of millions of expression profiles--database and tools updateNucleic Acids Research, 2006
- The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novoGenome Biology, 2006
- Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networksNature Biotechnology, 2005
- Sparse graphical models for exploring gene expression dataJournal of Multivariate Analysis, 2004
- Least angle regressionThe Annals of Statistics, 2004
- Faculty Opinions recommendation of Comparative gene expression profiles following UV exposure in wild-type and SOS-deficient Escherichia coli.Published by H1 Connect ,2001