E-MSD: an integrated data resource for bioinformatics
Open Access
- 17 December 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 33 (Database ) , D262-D265
- https://doi.org/10.1093/nar/gki058
Abstract
The Macromolecular Structure Database (MSD) group (http://www.ebi.ac.uk/msd/) continues to enhance the quality and consistency of macromolecular structure data in the worldwide Protein Data Bank (wwPDB) and to work towards the integration of various bioinformatics data resources. One of the major obstacles to the improved integration of structural databases such as MSD and sequence databases like UniProt is the absence of up to date and well-maintained mapping between corresponding entries. We have worked closely with the UniProt group at the EBI to clean up the taxonomy and sequence cross-reference information in the MSD and UniProt databases. This information is vital for the reliable integration of the sequence family databases such as Pfam and Interpro with the structure-oriented databases of SCOP and CATH. This information has been made available to the eFamily group (http://www.efamily.org.uk/) and now forms the basis of the regular interchange of information between the member databases (MSD, UniProt, Pfam, Interpro, SCOP and CATH). This exchange of annotation information has enriched the structural information in the MSD database with annotation from wider sequence-oriented resources. This work was carried out under the 'Structure Integration with Function, Taxonomy and Sequences (SIFTS)' initiative (http:// www.ebi.ac.uk/msd-srv/docs/sifts) in the MSD group.Keywords
This publication has 17 references indexed in Scilit:
- IntAct: an open source molecular interaction databaseNucleic Acids Research, 2004
- The EMBL Nucleotide Sequence DatabaseNucleic Acids Research, 2004
- IntEnz, the integrated relational enzyme databaseNucleic Acids Research, 2004
- MEROPS: the peptidase databaseNucleic Acids Research, 2004
- The KEGG resource for deciphering the genomeNucleic Acids Research, 2004
- The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene OntologyNucleic Acids Research, 2004
- Announcing the worldwide Protein Data BankNature Structural & Molecular Biology, 2003
- The InterPro Database, 2003 brings increased coverage and new featuresNucleic Acids Research, 2003
- A unifold, mesofold, and superfold model of protein fold useProteins-Structure Function and Bioinformatics, 2001
- Assigning genomic sequences to CATHNucleic Acids Research, 2000