UniProt: the Universal Protein knowledgebase
Top Cited Papers
Open Access
- 1 January 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 32 (90001) , 115D-119
- https://doi.org/10.1093/nar/gkh131
Abstract
To provide the scientific community with a single, centralized, authoritative resource for protein sequences and functional information, the Swiss-Prot, TrEMBL and PIR protein database activities have united to form the Universal Protein Knowledgebase (UniProt) consortium. Our mission is to provide a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and query interfaces. The central database will have two sections, corresponding to the familiar Swiss-Prot (fully manually curated entries) and TrEMBL (enriched with automated classification, annotation and extensive cross-references). For convenient sequence searches, UniProt also provides several non-redundant sequence databases. The UniProt NREF (UniRef) databases provide representative subsets of the knowledgebase suitable for efficient searching. The comprehensive UniProt Archive (UniParc) is updated daily from many public source databases. The UniProt databases can be accessed online (http://www.uniprot.org) or downloaded in several formats (ftp://ftp.uniprot.org/pub). The scientific community is encouraged to submit data for inclusion in UniProt.Keywords
This publication has 23 references indexed in Scilit:
- PIRSF: family classification system at the Protein Information ResourceNucleic Acids Research, 2004
- Recent improvements to the PROSITE databaseNucleic Acids Research, 2004
- Protein family classification and functional annotationComputational Biology and Chemistry, 2003
- The Protein Data Bank and structural genomicsNucleic Acids Research, 2003
- PRINTS and its automatic supplement, prePRINTSNucleic Acids Research, 2003
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- TIGRFAMs: a protein family resource for the functional identification of proteinsNucleic Acids Research, 2001
- RefSeq and LocusLink: NCBI gene-centered resourcesNucleic Acids Research, 2001
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000
- A novel method for automatic functional annotation of proteins.Bioinformatics, 1999