An experimentally derived confidence score for binary protein-protein interactions

Top Cited Papers
Open Access
Abstract
Use of the protein-protein interaction reference sets reported in this issue in Venkatesan et al. to benchmark four complementary protein-protein interaction assays, followed by the training of a logistic regression model, allows the assignment of standardized confidence scores to individual protein-protein interactions. Information on protein-protein interactions is of central importance for many areas of biomedical research. At present no method exists to systematically and experimentally assess the quality of individual interactions reported in interaction mapping experiments. To provide a standardized confidence-scoring method that can be applied to tens of thousands of protein interactions, we have developed an interaction tool kit consisting of four complementary, high-throughput protein interaction assays. We benchmarked these assays against positive and random reference sets consisting of well documented pairs of interacting human proteins and randomly chosen protein pairs, respectively. A logistic regression model was trained using the data from these reference sets to combine the assay outputs and calculate the probability that any newly identified interaction pair is a true biophysical interaction once it has been tested in the tool kit. This general approach will allow a systematic and empirical assignment of confidence scores to all individual protein-protein interactions in interactome networks.