AutoPSI: a database for automatic structural classification of protein sequences and structures
Open Access
- 11 October 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (Database) , D398-D401
- https://doi.org/10.1093/nar/gkm834
Abstract
In protein research, structural classifications of protein domains provided by databases such as SCOP play an important role. However, as such databases have to be curated and prepared carefully, they update only up to a few times per year, and in between newly entered PDB structures cannot be used in cases where a structural classification is required. The Automated Protein Structure Identification (AutoPSI) database delivers predicted SCOP classifications for several thousand yet unclassified PDB entries as well as millions of UniProt sequences in an automated fashion. In order to obtain predictions, we make use of two recently published methods, namely AutoSCOP (sequence-based) and Vorolign (structure-based) and the consensus of both. With our predictions, we bridge the gap between SCOP versions for proteins with known structures in the PDB and additionally make structure predictions for a very large number of UniProt proteins. AutoPSI is freely accessible at http://www.bio.ifi.lmu.de/AutoPSIDB .Keywords
This publication has 12 references indexed in Scilit:
- AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappingsBioinformatics, 2007
- Vorolign—fast structural alignment using Voronoi contactsBioinformatics, 2007
- The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB dataNucleic Acids Research, 2006
- The Universal Protein Resource (UniProt): an expanding universe of protein informationNucleic Acids Research, 2006
- MODBASE: a database of annotated comparative protein structure models and associated resourcesNucleic Acids Research, 2006
- The SWISS-MODEL Repository: new features and functionalitiesNucleic Acids Research, 2006
- InterProScan: protein domains identifierNucleic Acids Research, 2005
- A large-scale analysis of mRNA polyadenylation of human and mouse genesNucleic Acids Research, 2005
- InterPro, progress and status in 2005Nucleic Acids Research, 2004
- PDP: protein domain parserBioinformatics, 2003