Reverse Engineering Molecular Regulatory Networks from Microarray Data with qp-Graphs
- 1 February 2009
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 16 (2) , 213-227
- https://doi.org/10.1089/cmb.2008.08tt
Abstract
Reverse engineering bioinformatic procedures applied to high-throughput experimental data have become instrumental in generating new hypotheses about molecular regulatory mechanisms. This has been particularly the case for gene expression microarray data, where a large number of statistical and computational methodologies have been developed in order to assist in building network models of transcriptional regulation. A major challenge faced by every different procedure is that the number of available samples n for estimating the network model is much smaller than the number of genes p forming the system under study. This compromises many of the assumptions on which the statistics of the methods rely, often leading to unstable performance figures. In this work, we apply a recently developed novel methodology based in the so-called q-order limited partial correlation graphs, qp-graphs, which is specifically tailored towards molecular network discovery from microarray expression data with p ≫ n. Using experimental and functional annotation data from Escherichia coli, here we show how qp-graphs yield more stable performance figures than other state-of-the-art methods when the ratio of genes to experiments exceeds one order of magnitude. More importantly, we also show that the better performance of the qp-graph method on such a gene-to-sample ratio has a decisive impact on the functional coherence of the reverse-engineered transcriptional regulatory modules and becomes crucial in such a challenging situation in order to enable the discovery of a network of reasonable confidence that includes a substantial number of genes relevant to the essayed conditions. An R package, called qpgraph implementing this method is part of the Bioconductor project and can be downloaded from www.bioconductor.org. A parallel standalone version for the most computationally expensive calculations is available from http://functionalgenomics.upf.xsedu/qpgraph.Keywords
This publication has 24 references indexed in Scilit:
- Many Microbe Microarrays Database: uniformly normalized Affymetrix compendia with structured experimental metadataNucleic Acids Research, 2007
- How to infer gene networks from expression profilesMolecular Systems Biology, 2007
- NCBI GEO: mining tens of millions of expression profiles--database and tools updateNucleic Acids Research, 2006
- Transcriptional regulation of the fad regulon genes of Escherichia coli by ArcAMicrobiology, 2006
- Reverse engineering of regulatory networks in human B cellsNature Genetics, 2005
- Discovery of meaningful associations in genomic data using partial correlation coefficientsBioinformatics, 2004
- Integrating high-throughput and computational data elucidates bacterial networksNature, 2004
- Network biology: understanding the cell's functional organizationNature Reviews Genetics, 2004
- Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networksProceedings of the National Academy of Sciences, 2000
- Establishing the Positive Definiteness of the Sample Covariance MatrixThe Annals of Mathematical Statistics, 1970