InterPro: the integrative protein signature database
Top Cited Papers
Open Access
- 21 October 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (Database) , D211-D215
- https://doi.org/10.1093/nar/gkn785
Abstract
The InterPro database (http://www.ebi.ac.uk/interpro/) integrates together predictive models or ‘signatures’ representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. Integration is performed manually and approximately half of the total ∼58 000 signatures available in the source databases belong to an InterPro entry. Recently, we have started to also display the remaining un-integrated signatures via our web interface. Other developments include the provision of non-signature data, such as structural data, in new XML files on our FTP site, as well as the inclusion of matchless UniProtKB proteins in the existing match XML files. The web interface has been extended and now links out to the ADAN predicted protein–protein interaction database and the SPICE and Dasty viewers. The latest public release (v18.0) covers 79.8% of UniProtKB (v14.1) and consists of 16 549 entries. InterPro data may be accessed either via the web address above, via web services, by downloading files by anonymous FTP or by using the InterProScan search software (http://www.ebi.ac.uk/Tools/InterProScan/).Keywords
This publication has 26 references indexed in Scilit:
- Gene3D: comprehensive structural and functional annotation of genomesNucleic Acids Research, 2007
- The Universal Protein Resource (UniProt)Nucleic Acids Research, 2007
- Web Services at the European Bioinformatics InstituteNucleic Acids Research, 2007
- Curated genome annotation ofOryza sativassp.japonicaand comparative genome analysis withArabidopsis thalianaGenome Research, 2007
- New developments in the InterPro databaseNucleic Acids Research, 2007
- IntAct--open source resource for molecular interaction dataNucleic Acids Research, 2006
- PANTHER version 6: protein sequence and function evolution data with expanded representation of biological pathwaysNucleic Acids Research, 2006
- The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB dataNucleic Acids Research, 2006
- The SUPERFAMILY database in 2007: families and functionsNucleic Acids Research, 2006