Motif-directed network component analysis for regulatory network inference
Open Access
- 13 February 2008
- journal article
- conference paper
- Published by Springer Nature in BMC Bioinformatics
- Vol. 9 (S1) , S21
- https://doi.org/10.1186/1471-2105-9-s1-s21
Abstract
Network Component Analysis (NCA) has shown its effectiveness in discovering regulators and inferring transcription factor activities (TFAs) when both microarray data and ChIP-on-chip data are available. However, a NCA scheme is not applicable to many biological studies due to limited topology information available, such as lack of ChIP-on-chip data. We propose a new approach, motif-directed NCA (mNCA), to integrate motif information and gene expression data to infer regulatory networks. We develop motif-directed NCA (mNCA) to incorporate motif information into NCA for regulatory network inference. While motif information is readily available from knowledge databases, it is a "noisy" source of network topology information consisting of many false positives. To overcome this problem, we develop a stability analysis procedure embedded in mNCA to resolve the inconsistency between motif information and gene expression data, and to enable the identification of stable TFAs. The mNCA approach has been applied to a time course microarray data set of muscle regeneration. The experimental results show that the inferred TFAs are not only numerically stable but also biologically relevant to muscle differentiation process. In particular, several inferred TFAs like those of MyoD, myogenin and YY1 are well supported by biological experiments. A novel computational approach, mNCA, has been developed to integrate motif information and gene expression data for regulatory network reconstruction. Specifically, motif analysis is used to obtain initial network topology, and stability analysis is developed and applied with mNCA to extract stable TFAs. Experimental results on muscle regeneration microarray data have demonstrated that mNCA is a practical and reliable computational method for regulatory network inference and pathway discovery.Keywords
This publication has 17 references indexed in Scilit:
- Latent Variable and nICA Modeling of Pathway Gene Module CompositePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- bioNMF: a versatile tool for non-negative matrix factorization in biologyBMC Bioinformatics, 2006
- Nuclear envelope dystrophies show a transcriptional fingerprint suggesting disruption of Rb–MyoD pathways in muscle regenerationBrain, 2006
- P-Match: transcription factor binding site search by combining patterns and weight matricesNucleic Acids Research, 2005
- Inferring yeast cell cycle regulators and interactions using transcription factor activitiesBMC Genomics, 2005
- MATCHTM: a tool for searching transcription factor binding sites in DNA sequencesNucleic Acids Research, 2003
- PromoSer: a large-scale mammalian promoter and transcription start site identification serviceNucleic Acids Research, 2003
- Module networks: identifying regulatory modules and their condition-specific regulators from gene expression dataNature Genetics, 2003
- From patterns to pathways: gene expression data analysis comes of ageNature Genetics, 2002
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002