Connecting protein structure with predictions of regulatory sites
- 24 April 2007
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 104 (17) , 7068-7073
- https://doi.org/10.1073/pnas.0701356104
Abstract
A common task posed by microarray experiments is to infer the binding site preferences for a known transcription factor from a collection of genes that it regulates and to ascertain whether the factor acts alone or in a complex. The converse problem can also be posed: Given a collection of binding sites, can the regulatory factor or complex of factors be inferred? Both tasks are substantially facilitated by using relatively simple homology models for protein–DNA interactions, as well as the rapidly expanding protein structure database. For budding yeast, we are able to construct reliable structural models for 67 transcription factors and with them redetermine factor binding sites by using a Bayesian Gibbs sampling algorithm and an extensive protein localization data set. For 49 factors in common with a prior analysis of this data set (based largely on phylogenetic conservation), we find that half of the previously predicted binding motifs are in need of some revision. We also solve the inverse problem of ascertaining the factors from the binding sites by assigning a correct protein fold to 25 of the 49 cases from a previous study. Our approach is easily extended to other organisms, including higher eukaryotes. Our study highlights the utility of enlarging current structural genomics projects that exhaustively sample fold structure space to include all factors with significantly different DNA-binding specificities.Keywords
This publication has 52 references indexed in Scilit:
- Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCEBioinformatics, 2006
- PhyloGibbs: A Gibbs Sampling Motif Finder That Incorporates PhylogenyPLoS Computational Biology, 2005
- Ab Initio Prediction of Transcription Factor Targets Using Structural KnowledgePLoS Computational Biology, 2005
- Improved detection of DNA motifs using a self-organized clustering of familial binding profilesBioinformatics, 2005
- Structural Alignment of Protein–DNA Interfaces: Insights into the Determinants of Binding SpecificityJournal of Molecular Biology, 2005
- Constrained Binding Site Diversity within Families of Transcription Factors Enhances Pattern Discovery BioinformaticsJournal of Molecular Biology, 2004
- Conserved homeodomain proteins interact with MADS box protein Mcm1 to restrict ECB-dependent transcription to the M/G1 phase of the cell cycleGenes & Development, 2002
- Genome-Wide Location and Function of DNA Binding ProteinsScience, 2000
- Protein-DNA interactions: a structural analysisJournal of Molecular Biology, 1999
- Crystal Structure of a bZIP/DNA Complex at 2.2 Å: Determinants of DNA Specific RecognitionJournal of Molecular Biology, 1995