MODBASE, a database of annotated comparative protein structure models

1 January 2002

journal article
research article
Published by Oxford University Press (OUP) in Nucleic Acids Research

Vol. 30 (1) , 255-259
https://doi.org/10.1093/nar/30.1.255

Abstract

MODBASE (http://guitar.rockefeller.edu/modbase) is a relational database of annotated comparative protein structure models for all available protein sequences matched to at least one known protein structure. The models are calculated by MODPIPE, an automated modeling pipeline that relies on PSI-BLAST, IMPALA and MODELLER. MODBASE uses the MySQL relational database management system for flexible and efficient querying, and the MODVIEW Netscape plugin for viewing and manipulating multiple sequences and structures. It is updated regularly to reflect the growth of the protein sequence and structure databases, as well as improvements in the software for calculating the models. For ease of access, MODBASE is organized into different datasets. The largest dataset contains models for domains in 304 517 out of 539 171 unique protein sequences in the complete TrEMBL database (23 March 2001); only models based on significant alignments (PSI-BLAST E-value < 10(-4)) and models assessed to have the correct fold are included. Other datasets include models for target selection and structure-based annotation by the New York Structural Genomics Research Consortium, models for prediction of genes in the Drosophila melanogaster genome, models for structure determination of several ribosomal particles and models calculated by the MODWEB comparative modeling web server.

Keywords

This publication has 47 references indexed in Scilit:

Ab Initio Protein Structure Prediction: Progress and Prospects
Annual Review of Biophysics, 2001
The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologues
Protein Engineering, Design and Selection, 2000
Expectations from structural genomics
Protein Science, 2000
The Protein Data Bank
Nucleic Acids Research, 2000
GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences
Journal of Molecular Biology, 1999
100,000 protein structures for the biologist
Nature Structural & Molecular Biology, 1998
Homology-based fold predictions for Mycoplasma genitalium proteins 1 1Edited by G. Von Heijne
Journal of Molecular Biology, 1998
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research, 1997
Comparative Protein Modelling by Satisfaction of Spatial Restraints
Journal of Molecular Biology, 1993
Assessment of protein models with three-dimensional profiles
Nature, 1992