STRING: a database of predicted functional associations between proteins
Top Cited Papers
- 1 January 2003
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (1) , 258-261
- https://doi.org/10.1093/nar/gkg034
Abstract
Functional links between proteins can often be inferred from genomic associations between the genes that encode them: groups of genes that are required for the same function tend to show similar species coverage, are often located in close proximity on the genome (in prokaryotes), and tend to be involved in gene-fusion events. The database STRING is a precomputed global resource for the exploration and analysis of these associations. Since the three types of evidence differ conceptually, and the number of predicted interactions is very large, it is essential to be able to assess and compare the significance of individual predictions. Thus, STRING contains a unique scoring-framework based on benchmarks of the different types of associations against a common reference set, integrated in a single confidence score per prediction. The graphical representation of the network of inferred, weighted protein interactions provides a high-level view of functional linkage, facilitating the analysis of modularity in biological processes. STRING is updated continuously, and currently contains 261 033 orthologs in 89 fully sequenced genomes. The database predicts functional interactions at an expected level of accuracy of at least 80% for more than half of the genes; it is online at http://www.bork.embl-heidelberg.de/STRING/.Keywords
This publication has 23 references indexed in Scilit:
- Lineage-specific loss and divergence of functionally linked genes in eukaryotesProceedings of the National Academy of Sciences, 2000
- STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a geneNucleic Acids Research, 2000
- Predicting Protein Function by Genomic Context: Quantitative Evaluation and Qualitative InferencesGenome Research, 2000
- Computational genetics: finding protein function by nonhomology methodsCurrent Opinion in Structural Biology, 2000
- Exploitation of gene contextCurrent Opinion in Structural Biology, 2000
- Who's your neighbor? New computational approaches for functional genomicsNature Biotechnology, 2000
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- Genome evolutionTrends in Genetics, 2000
- Protein interaction maps for complete genomes based on gene fusion eventsNature, 1999
- Detecting Protein Function and Protein-Protein Interactions from Genome SequencesScience, 1999