Structure-Templated Predictions of Novel Protein Interactions from Sequence Information
Open Access
- 21 September 2007
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 3 (9) , e182-9
- https://doi.org/10.1371/journal.pcbi.0030182
Abstract
The multitude of functions performed in the cell are largely controlled by a set of carefully orchestrated protein interactions often facilitated by specific binding of conserved domains in the interacting proteins. Interacting domains commonly exhibit distinct binding specificity to short and conserved recognition peptides called binding profiles. Although many conserved domains are known in nature, only a few have well-characterized binding profiles. Here, we describe a novel predictive method known as domain–motif interactions from structural topology (D-MIST) for elucidating the binding profiles of interacting domains. A set of domains and their corresponding binding profiles were derived from extant protein structures and protein interaction data and then used to predict novel protein interactions in yeast. A number of the predicted interactions were verified experimentally, including new interactions of the mitotic exit network, RNA polymerases, nucleotide metabolism enzymes, and the chaperone complex. These results demonstrate that new protein interactions can be predicted exclusively from sequence information. Many functions performed within a living cell are mediated by specific interactions between proteins. Precise geometric and chemical matches between segments of the protein structures facilitate those interactions. Such binding surfaces are often evolutionarily conserved elements of protein structures known as conserved domains that recognize specific binding elements on the interacting proteins. Binding domains and their corresponding interacting profiles constitute basic interacting modules that are replicated in multiple protein pairs, where they mediate similar interactions. Although many conserved domains are identified, only a handful have known, well-characterized binding elements. This paper describes a computational method that aims to elucidate the binding specificity of many domains. The utility of the derived binding specificity is demonstrated by predicting new interactions between yeast proteins. The predictions are based solely on sequence information by identifying the conserved domains and their corresponding binding sequences. A number of the predicted interactions were confirmed experimentally, demonstrating the feasibility of this approach.Keywords
This publication has 56 references indexed in Scilit:
- Towards a proteome-scale map of the human protein–protein interaction networkNature, 2005
- A Human Protein-Protein Interaction Network: A Resource for Annotating the ProteomePublished by Elsevier ,2005
- Inferring protein domain interactions from databases of interacting proteinsGenome Biology, 2005
- Inferring protein–protein interactions through high-throughput interaction data from diverse organismsBioinformatics, 2005
- Life sentences: Ontology recapitulates philology.Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Multisubunit RNA polymerasesCurrent Opinion in Structural Biology, 2002
- A Combined Experimental and Computational Strategy to Define Protein Interaction Networks for Peptide Recognition ModulesScience, 2002
- Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometryNature, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- MINT: a Molecular INTeraction databaseFEBS Letters, 2001