MODBASE: a database of annotated comparative protein structure models and associated resources
Open Access
- 1 January 2006
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (90001) , D291-D295
- https://doi.org/10.1093/nar/gkj059
Abstract
MODBASE (http://salilab.org/modbase) is a database of annotated comparative protein structure models for all available protein sequences that can be matched to at least one known protein structure. The models are calculated by MODPIPE, an automated modeling pipeline that relies on MODELLER for fold assignment, sequence-structure alignment, model building and model assessment (http:/salilab.org/modeller). MODBASE is updated regularly to reflect the growth in protein sequence and structure databases, and improvements in the software for calculating the models. MODBASE currently contains 3 094 524 reliable models for domains in 1 094 750 out of 1 817 889 unique protein sequences in the UniProt database (July 5, 2005); only models based on statistically significant alignments and models assessed to have the correct fold despite insignificant alignments are included. MODBASE also allows users to generate comparative models for proteins of interest with the automated modeling server MODWEB (http://salilab.org/modweb). Our other resources integrated with MODBASE include comprehensive databases of multiple protein structure alignments (DBAli, http://salilab.org/dbali), structurally defined ligand binding sites and structurally defined binary domain interfaces (PIBASE, http://salilab.org/pibase) as well as predictions of ligand binding sites, interactions between yeast proteins, and functional consequences of human nsSNPs (LS-SNP, http://salilab.org/LS-SNP).Keywords
This publication has 37 references indexed in Scilit:
- New York-Structural GenomiX Research Consortium (NYSGXRC): A Large Scale Center for the Protein Structure InitiativeJournal of Structural and Functional Genomics, 2005
- The Universal Protein Resource (UniProt)Nucleic Acids Research, 2004
- The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysisNucleic Acids Research, 2004
- Global analysis of protein expression in yeastNature, 2003
- Tools for comparative protein structure modeling and analysisNucleic Acids Research, 2003
- MAMMOTH (Matching molecular models obtained from theory): An automated method for model comparisonProtein Science, 2002
- The Protein Data Bank: unifying the archiveNucleic Acids Research, 2002
- Protein Structure Prediction and Structural GenomicsScience, 2001
- Structural genomics and its importance for gene function analysisNature Biotechnology, 2000
- A variable gap penalty function and feature weights for protein 3-D structure comparisonsProtein Engineering, Design and Selection, 1992