The InterPro Database, 2003 brings increased coverage and new features
Top Cited Papers
- 1 January 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (1) , 315-318
- https://doi.org/10.1093/nar/gkg046
Abstract
InterPro, an integrated documentation resource of protein families, domains and functional sites, was created in 1999 as a mean of amalgamating the major protein signature database into one comprehensive resource. PROSITE, Pfam, PRINTS, ProDom, SMART and TIGRFAMs have been manually integrated and curated and are available in InterPro for text- and sequence-based searching. The results are provided in a single format that rationalises the result that would be obtained by searching the member databases individually. The latest release of InterPro contains 5629 entries describing 4280 families, 1239 domains, 95 repeats and 15 post-translational modi cations. Currently, the combined signatures in InterPro cover more than 74% of all proteins in SWISS-PROT and TrEMBL, an increase of nearly 15% since the inception of InterPro. New features of the database include improved searching capabilities and enhanced graphical user interfaces for visualisation of the data. The database is available via a webserver (http://www.ebi.ac.uk/interpro) and anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).Keywords
This publication has 21 references indexed in Scilit:
- InterProScan – an integration platform for the signature-recognition methods in InterProBioinformatics, 2001
- Creating the Gene Ontology Resource: Design and ImplementationGenome Research, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- iProClass: an integrated, comprehensive and annotated protein classification databaseNucleic Acids Research, 2001
- TIGRFAMs: a protein family resource for the functional identification of proteinsNucleic Acids Research, 2001
- The InterPro database, an integrated documentation resource for protein families, domains and functional sitesNucleic Acids Research, 2001
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- Browsing protein families via the ‘Rich Family Description’ formatBioinformatics, 1999
- A Neural Network Method for Identification of Prokaryotic and Eukaryotic Signal Peptides and Prediction of their Cleavage SitesInternational Journal of Neural Systems, 1997