STRALCP structure alignment-based clustering of proteins
Open Access
- 26 November 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (22) , e150
- https://doi.org/10.1093/nar/gkm1049
Abstract
Protein structural annotation and classification is an important and challenging problem in bioinformatics. Research towards analysis of sequence–structure correspondences is critical for better understanding of a protein's structure, function, and its interaction with other molecules. Clustering of protein domains based on their structural similarities provides valuable information for protein classification schemes. In this article, we attempt to determine whether structure information alone is sufficient to adequately classify protein structures. We present an algorithm that identifies regions of structural similarity within a given set of protein structures, and uses those regions for clustering. In our approach, called STRALCP (STRucture ALignment-based Clustering of Proteins), we generate detailed information about global and local similarities between pairs of protein structures, identify fragments (spans) that are structurally conserved among proteins, and use these spans to group the structures accordingly. We also provide a web server at http://as2ts.llnl.gov/AS2TS/STRALCP/ for selecting protein structures, calculating structurally conserved regions and performing automated clustering.Keywords
This publication has 24 references indexed in Scilit:
- Automatic 3D Protein Structure Classification without Structural AlignmentJournal of Computational Biology, 2005
- AS2TS system for protein structure modeling and analysisNucleic Acids Research, 2005
- Survey of current protein family databases and their application in comparative, structural and functional genomicsJournal of Chromatography B, 2005
- SCOPmap: Automated assignment of protein structures to evolutionary superfamiliesBMC Bioinformatics, 2004
- Automatic classification of protein structure by using Gauss integralsProceedings of the National Academy of Sciences, 2002
- MAMMOTH (Matching molecular models obtained from theory): An automated method for model comparisonProtein Science, 2002
- The Three-dimensional Structure of a Superantigen-like Protein, SET3, from a Pathogenicity Island of the Staphylococcus aureus GenomeJournal of Biological Chemistry, 2002
- Identification of homology in protein structure classification.Nature Structural & Molecular Biology, 2001
- The crystal structure of staphylococcal enterotoxin H: implications for binding properties to MHC class II and TcR moleculesJournal of Molecular Biology, 2000
- The Protein Data BankNucleic Acids Research, 2000