Structure‐based function inference using protein family‐specific fingerprints
Open Access
- 1 June 2006
- journal article
- Published by Wiley in Protein Science
- Vol. 15 (6) , 1537-1543
- https://doi.org/10.1110/ps.062189906
Abstract
We describe a method to assign a protein structure to a functional family using family‐specific fingerprints. Fingerprints represent amino acid packing patterns that occur in most members of a family but are rare in the background, a nonredundant subset of PDB; their information is additional to sequence alignments, sequence patterns, structural superposition, and active‐site templates. Fingerprints were derived for 120 families in SCOP using Frequent Subgraph Mining. For a new structure, all occurrences of these family‐specific fingerprints may be found by a fast algorithm for subgraph isomorphism; the structure can then be assigned to a family with a confidence value derived from the number of fingerprints found and their distribution in background proteins. In validation experiments, we infer the function of new members added to SCOP families and we discriminate between structurally similar, but functionally divergent TIM barrel families. We then apply our method to predict function for several structural genomics proteins, including orphan structures. Some predictions have been corroborated by other computational methods and some validated by subsequent functional characterization.Keywords
This publication has 34 references indexed in Scilit:
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- Beyond annotation transfer by homology: novel protein-function prediction methods to assist drug discoveryDrug Discovery Today, 2005
- Protein Function Prediction Using Local 3D TemplatesJournal of Molecular Biology, 2005
- Inference of Protein Function from Protein StructurePublished by Elsevier ,2005
- Crystal structure of the Escherichia coli YcdX protein reveals a trinuclear zinc active siteProteins-Structure Function and Bioinformatics, 2003
- Functional Sites in Protein Families Uncovered via an Objective and Automated Graph Theoretic ApproachJournal of Molecular Biology, 2003
- Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein dockingJournal of Molecular Biology, 2001
- Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scoresJournal of Molecular Biology, 2000
- VMD: Visual molecular dynamicsJournal of Molecular Graphics, 1996
- A Graph-theoretic Approach to the Identification of Three-dimensional Patterns of Amino Acid Side-chains in Protein StructuresJournal of Molecular Biology, 1994