Structure-based prediction of C2H2 zinc-finger binding specificity: sensitivity to docking geometry
Open Access
- 30 January 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (4) , 1085-1097
- https://doi.org/10.1093/nar/gkl1155
Abstract
Predicting the binding specificity of transcription factors is a critical step in the characterization and computational identification and of cis-regulatory elements in genomic sequences. Here we use protein–DNA structures to predict binding specificity and consider the possibility of predicting position weight matrices (PWM) for an entire protein family based on the structures of just a few family members. A particular focus is the sensitivity of prediction accuracy to the docking geometry of the structure used. We investigate this issue with the goal of determining how similar two docking geometries must be for binding specificity predictions to be accurate. Docking similarity is quantified using our recently described interface alignment score (IAS). Using a molecular-mechanics force field, we predict high-affinity nucleotide sequences that bind to the second zinc-finger (ZF) domain from the Zif268 protein, using different C2H2 ZF domains as structural templates. We identify a strong relationship between IAS values and prediction accuracy, and define a range of IAS values for which accurate structure-based predictions of binding specificity is to be expected. The implication of our results for large-scale, structure-based prediction of PWMs is discussed.Keywords
This publication has 69 references indexed in Scilit:
- Information-driven protein-DNA docking using HADDOCK: it is a matter of flexibilityNucleic Acids Research, 2006
- Ab Initio Prediction of Transcription Factor Targets Using Structural KnowledgePLoS Computational Biology, 2005
- WebLogo: A Sequence Logo Generator: Figure 1Genome Research, 2004
- A vision for the future of genomics researchNature, 2003
- An Orientation-dependent Hydrogen Bonding Potential Improves Prediction of Specificity and Structure for Proteins and Protein–Protein ComplexesJournal of Molecular Biology, 2003
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Genome-Wide Location and Function of DNA Binding ProteinsScience, 2000
- Identification of regulatory regions which confer muscle-specific gene expressionJournal of Molecular Biology, 1998
- JUMNA (junction minimisation of nucleic acids)Computer Physics Communications, 1995
- Sequence logos: a new way to display consensus sequencesNucleic Acids Research, 1990