PASTAA: identifying transcription factors associated with sets of co-regulated genes
Open Access
- 9 December 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 25 (4) , 435-442
- https://doi.org/10.1093/bioinformatics/btn627
Abstract
Motivation: A major challenge in regulatory genomics is the identification of associations between functional categories of genes (e.g. tissues, metabolic pathways) and their regulating transcription factors (TFs). While, for a limited number of categories, the regulating TFs are already known, still for many functional categories the responsible factors remain to be elucidated. Results: We put forward a novel method (PASTAA) for detecting transcriptions factors associated with functional categories, which utilizes the prediction of binding affinities of a TF to promoters. This binding strength information is compared to the likelihood of membership of the corresponding genes in the functional category under study. Coherence between the two ranked datasets is seen as an indicator of association between a TF and the category. PASTAA is applied primarily to the determination of TFs driving tissue-specific expression. We show that PASTAA is capable of recovering many TFs acting tissue specifically and, in addition, provides novel associations so far not detected by alternative methods. The application of PASTAA to detect TFs involved in the regulation of tissue-specific gene expression revealed a remarkable number of experimentally supported associations. The validated success for various datasets implies that PASTAA can directly be applied for the detection of TFs associated with newly derived gene sets. Availability: The PASTAA source code as well as a corresponding web interface is freely available at http://trap.molgen.mpg.de Contact:roider@molgen.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 48 references indexed in Scilit:
- Systematic identification of mammalian regulatory motifs' target genes and functionsNature Methods, 2008
- Ancora: a web resource for exploring highly conserved noncoding elements and their association with developmental regulatory genesGenome Biology, 2008
- Systematic functional characterization ofcis-regulatory motifs in human core promotersGenome Research, 2008
- oPOSSUM: integrated tools for analysis of regulatory motif over-representationNucleic Acids Research, 2007
- PAP: a comprehensive workbench for mammalian transcriptional regulatory sequence analysisNucleic Acids Research, 2007
- Predicting tissue-specific enhancers in the human genomeGenome Research, 2007
- Global mapping of c-Myc binding sites and target gene networks in human B cellsProceedings of the National Academy of Sciences, 2006
- Computational analysis of tissue-specific combinatorial gene regulation: predicting interaction between transcription factors in human tissuesNucleic Acids Research, 2006
- DNA motifs in human and mouse proximal promoters predict tissue-specific expressionProceedings of the National Academy of Sciences, 2006
- Partially Phosphorylated Pho4 Activates Transcription of a Subset of Phosphate-Responsive GenesPLoS Biology, 2003