ASTRAL compendium enhancements
Open Access
- 1 January 2002
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 30 (1) , 260-263
- https://doi.org/10.1093/nar/30.1.260
Abstract
The ASTRAL compendium provides several databases and tools to aid in the analysis of protein structures, particularly through the use of their sequences. It is partially derived from the SCOP database of protein domains, and it includes sequences for each domain as well as other resources useful for studying these sequences and domain structures. Several major improvements have been made to the ASTRAL compendium since its initial release 2 years ago. The number of protein domain sequences included has doubled from 15 190 to 30 867, and additional databases have been added. The Rapid Access Format (RAF) database contains manually curated mappings linking the biological amino acid sequences described in the SEQRES records of PDB entries to the amino acid sequences structurally observed (provided in the ATOM records) in a format designed for rapid access by automated tools. This information is used to derive sequences for protein domains in the SCOP database. In cases where a SCOP domain spans several protein chains, all of which can be traced back to a single genetic source, a 'genetic domain' sequence is created by concatenating the sequences of each chain in the order found in the original gene sequence. Both the original-style library of SCOP sequences and a new library including genetic domain sequences are available. Selected representative subsets of each of these libraries, based on multiple criteria and degrees of similarity, are also included. ASTRAL may be accessed at http://astral.stanford.edu/.Keywords
This publication has 11 references indexed in Scilit:
- SCOP database in 2002: refinements accommodate structural genomicsNucleic Acids Research, 2002
- The ASTRAL compendium for protein structure and sequence analysisNucleic Acids Research, 2000
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- The Protein Data BankNucleic Acids Research, 2000
- [26] Raster3D: Photorealistic molecular graphicsPublished by Elsevier ,1997
- The PDBFINDER database: a summary of PDB, DSSP and HSSP information with added valueBioinformatics, 1996
- Errors in protein structuresNature, 1996
- SCOP: a structural classification of proteins database for the investigation of sequences and structures.Journal of Molecular Biology, 1995
- Conformational changes in cubic insulin crystals in the pH range 7–11Biophysical Journal, 1992
- Basic local alignment search toolJournal of Molecular Biology, 1990