The Pfam protein families database
Top Cited Papers
Open Access
- 29 November 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 40 (D1) , D290-D301
- https://doi.org/10.1093/nar/gkr1065
Abstract
Pfam is a widely used database of protein families, currently containing more than 13 000 manually curated protein families as of release 26.0. Pfam is available via servers in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/). Here, we report on changes that have occurred since our 2010 NAR paper (release 24.0). Over the last 2 years, we have generated 1840 new families and increased coverage of the UniProt Knowledgebase (UniProtKB) to nearly 80%. Notably, we have taken the step of opening up the annotation of our families to the Wikipedia community, by linking Pfam families to relevant Wikipedia pages and encouraging the Pfam and Wikipedia communities to improve and expand those pages. We continue to improve the Pfam website and add new visualizations, such as the ‘sunburst’ representation of taxonomic distribution of families. In this work we additionally address two topics that will be of particular interest to the Pfam community. First, we explain the definition and use of family-specific, manually curated gathering thresholds. Second, we discuss some of the features of domains of unknown function (also known as DUFs), which constitute a rapidly growing class of families within Pfam.Keywords
This publication has 30 references indexed in Scilit:
- Hidden Markov model speed heuristic and iterative HMM search procedureBMC Bioinformatics, 2010
- The crystal structure of Mtr4 reveals a novel arch domain required for rRNA processingThe EMBO Journal, 2010
- The Pfam protein families databaseNucleic Acids Research, 2009
- Ensembl Genomes: Extending Ensembl across the taxonomic spaceNucleic Acids Research, 2009
- UniRef: comprehensive and non-redundant UniProt reference clustersBioinformatics, 2007
- Purification, characterization, and molecular gene cloning of an antifungal protein from Ginkgo biloba seedsBiological Chemistry, 2007
- Pfam: clans, web tools and servicesNucleic Acids Research, 2006
- Anillin Is a Substrate of Anaphase-promoting Complex/Cyclosome (APC/C) That Controls Spatial Contractility of Myosin during Late CytokinesisJournal of Biological Chemistry, 2005
- Identification of an Escherichia coli Operon Required for Formation of the O-Antigen CapsuleJournal of Bacteriology, 2005
- Exhaustive Enumeration of Protein Domain FamiliesJournal of Molecular Biology, 2003