Validating module network learning algorithms using simulated data
Open Access
- 3 May 2007
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 8 (S2) , S5
- https://doi.org/10.1186/1471-2105-8-s2-s5
Abstract
In recent years, several authors have used probabilistic graphical models to learn expression modules and their regulatory programs from gene expression data. Despite the demonstrated success of such algorithms in uncovering biologically relevant regulatory relations, further developments in the area are hampered by a lack of tools to compare the performance of alternative module network learning strategies. Here, we demonstrate the use of the synthetic data generator SynTReN for the purpose of testing and comparing module network learning algorithms. We introduce a software package for learning module networks, called LeMoNe, which incorporates a novel strategy for learning regulatory programs. Novelties include the use of a bottom-up Bayesian hierarchical clustering to construct the regulatory programs, and the use of a conditional entropy measure to assign regulators to the regulation program nodes. Using SynTReN data, we test the performance of LeMoNe in a completely controlled situation and assess the effect of the methodological changes we made with respect to an existing software package, namely Genomica. Additionally, we assess the effect of various parameters, such as the size of the data set and the amount of noise, on the inference performance.Keywords
All Related Versions
This publication has 31 references indexed in Scilit:
- Genomic analysis of regulatory network dynamics reveals large topological changesNature, 2004
- Transcriptional regulatory code of a eukaryotic genomeNature, 2004
- Predicting Gene Expression from SequenceCell, 2004
- Inferring Cellular Networks Using Probabilistic Graphical ModelsScience, 2004
- Computational discovery of gene modules and regulatory networksNature Biotechnology, 2003
- Module networks: identifying regulatory modules and their condition-specific regulators from gene expression dataNature Genetics, 2003
- A NEWAPPROACH TODECODINGLIFE: Systems BiologyAnnual Review of Genomics and Human Genetics, 2001
- Inferring subnetworks from perturbed expression profilesBioinformatics, 2001
- Using Bayesian Networks to Analyze Expression DataJournal of Computational Biology, 2000
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000