Automated protein function prediction--the genomic challenge

Top Cited Papers

23 May 2006

journal article
review article
Published by Oxford University Press (OUP) in Briefings in Bioinformatics

Vol. 7 (3) , 225-242
https://doi.org/10.1093/bib/bbl004

Abstract

Overwhelmed with genomic data, biologists are facing the first big post-genomic question-what do all genes do? First, not only is the volume of pure sequence and structure data growing, but its diversity is growing as well, leading to a disproportionate growth in the number of uncharacterized gene products. Consequently, established methods of gene and protein annotation, such as homology-based transfer, are annotating less data and in many cases are amplifying existing erroneous annotation. Second, there is a need for a functional annotation which is standardized and machine readable so that function prediction programs could be incorporated into larger workflows. This is problematic due to the subjective and contextual definition of protein function. Third, there is a need to assess the quality of function predictors. Again, the subjectivity of the term 'function' and the various aspects of biological function make this a challenging effort. This article briefly outlines the history of automated protein function prediction and surveys the latest innovations in all three topics.

Keywords

This publication has 74 references indexed in Scilit:

Protein Molecular Function Prediction by Bayesian Phylogenomics
PLoS Computational Biology, 2005
Improving the Precision of the Structure–Function Relationship by Considering Phylogenetic Context
PLoS Computational Biology, 2005
Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes
PLoS Computational Biology, 2005
Inference of Protein Function from Protein Structure
Published by Elsevier ,2005
FAST: A novel protein structure alignment algorithm
Proteins-Structure Function and Bioinformatics, 2004
The PANTHER database of protein families, subfamilies, functions and pathways
Nucleic Acids Research, 2004
UniProt: the Universal Protein knowledgebase
Nucleic Acids Research, 2004
Automatic prediction of protein function
Cellular and Molecular Life Sciences, 2003
SNAPping up functionally related genes based on context information: a colinearity-free approach
Journal of Molecular Biology, 2001
Threading a database of protein cores
Proteins-Structure Function and Bioinformatics, 1995