MODBASE, a database of annotated comparative protein structure models
- 1 January 2002
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 30 (1) , 255-259
- https://doi.org/10.1093/nar/30.1.255
Abstract
MODBASE (http://guitar.rockefeller.edu/modbase) is a relational database of annotated comparative protein structure models for all available protein sequences matched to at least one known protein structure. The models are calculated by MODPIPE, an automated modeling pipeline that relies on PSI-BLAST, IMPALA and MODELLER. MODBASE uses the MySQL relational database management system for flexible and efficient querying, and the MODVIEW Netscape plugin for viewing and manipulating multiple sequences and structures. It is updated regularly to reflect the growth of the protein sequence and structure databases, as well as improvements in the software for calculating the models. For ease of access, MODBASE is organized into different datasets. The largest dataset contains models for domains in 304 517 out of 539 171 unique protein sequences in the complete TrEMBL database (23 March 2001); only models based on significant alignments (PSI-BLAST E-value < 10(-4)) and models assessed to have the correct fold are included. Other datasets include models for target selection and structure-based annotation by the New York Structural Genomics Research Consortium, models for prediction of genes in the Drosophila melanogaster genome, models for structure determination of several ribosomal particles and models calculated by the MODWEB comparative modeling web server.Keywords
This publication has 47 references indexed in Scilit:
- Ab Initio Protein Structure Prediction: Progress and ProspectsAnnual Review of Biophysics, 2001
- The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologuesProtein Engineering, Design and Selection, 2000
- Expectations from structural genomicsProtein Science, 2000
- The Protein Data BankNucleic Acids Research, 2000
- GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequencesJournal of Molecular Biology, 1999
- 100,000 protein structures for the biologistNature Structural & Molecular Biology, 1998
- Homology-based fold predictions for Mycoplasma genitalium proteins 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Comparative Protein Modelling by Satisfaction of Spatial RestraintsJournal of Molecular Biology, 1993
- Assessment of protein models with three-dimensional profilesNature, 1992